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DO ALUMNI AND STUDENTS DIFFER IN THEIR 
ATTITUDES TOWARD INSTRUCTORS? 


A. J. DRUCKER AND H. H. REMMERS* 


Purdue University 


Among early comments concerning the rating of instructors by 
students was the criticism that ‘student judgment may be unripe 
and different from later judgment...’ Since the time that 
statement and others like it were made, the influence of maturity 
within undergraduate groups upon ratings of instructors, using 
the Purdue Rating Scale for Instructors, has been investigated. 
Comparisons of the ratings of freshmen, sophomores, juniors and 
seniors showed slight differences in ratings of some of the traits, 
but these were ‘“‘relatively unimportant as compared with the 
resemblances among the four classes.’”® 

The current investigation was undertaken to test the hypothe- 
sis that students’ ideas of what constitutes a good instructor 
change as the students grow older, by comparing student and 
alumni judgments of the same traits and instructors. The 
purpose of the Purdue Rating Scale for Instructors is to measure 
the student’s current judgments, or prejudices, as an important 
aspect of the teacher-student relationship, important as a vital 
factor in determining the learning process. It has been stipu- 
lated that the teacher who uses this scale must constantly bear 
in mind that the scale is primarily a device for ascertaining 
student judgment concerning the traits in question. Regardless 
of what the teacher believes or knows about himself with respect 
to these traits, the student attitude exists, and exists as a func- 
tioning factor in the classroom procedure.* Results of this 
investigation hence cannot be expected to alter the basic purposes 





* Dr. N. L. Gage gave major assistance in the design of the experiment. 
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or aims of the Purdue Rating Scale for Instructors. If, however, 
it can be shown that students’ judgments of the same instructors 
do not actually change with the years, perhaps more instructors 
will avail themselves of the instrument with increased confidence 
and reliance in the judgments of the undergraduates they teach. 

To test the above hypothesis, two types of comparisons were 
to be made: 1) a comparison of the attitudes of contemporary 
undergraduates toward college instructors with the attitudes of 
alumni of ten or more years’ standing toward the same instruc- 
tors; 2) a comparison of the desiderata of good college instructors 
as conceived by contemporary undergraduates and by alumni of 
ten or more years’ standing. By ‘attitudes toward instructors’ 
we mean ratings by students and alumni of their instructors on 
the Purdue Rating Scale for Instructors. By ‘Instructors’ we 
mean members of the instructional staff for undergraduates at 
Purdue University. By ‘desiderata of good instructors’ we 
mean ratings as to importance for good instruction of a series of 
traits of instructors. 

PROCEDURE 


Since a follow-up study to test our hypothesis was not feasible, 
it was necessary to obtain the judgments of Purdue University 
graduates of the years of 1936 to 1939 and compare them with 
the judgments of current students of the same curricula. In 
addition it was necessary to use in the study instructors who 
were on the staff in 1936 as well as now, and who are still teaching 
the same courses. The procedure involved in obtaining ratings 
was different in at least two respects from that usually involved 
in obtaining ratings: 1) one to five ratings were to be made on a 
single scale, whereas in the past only a single instructor was 
rated on a scale; 2) the ratings were to be made outside the class- 
room with the permission of the instructors. Since these differ- 
ences in procedure were alike for students and alumni making the 
ratings, they were not expected to have any effect upon any 
comparisons of ratings by students and alumni. The fact that 
classroom time was not being used for making the ratings may 
have had the effect of obtaining ratings on a more representative 
group of instructors than hitherto has been obtained, since in the 
past only those instructors who both desired the ratings and were 
willing to take class time for them have been rated. 

Selecting Instructors.—A list was made of all instructors appear- 
ing in both the Purdue University Catalogue of 1936 and in the 
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1948-1949 Catalogue. A letter was then sent to each instructor 
stating the purpose of the investigation and asking the participa- 
tion of the instructor whose réle would be purely a passive one— 
that of being rated by alumni and students. Of one hundred 
thirty-nine instructors of whom participation was requested, one 
hundred two consented to being rated. Of the thirty-seven who 
did not give consent, several had no classes of undergraduate 
students for the current year. 

Selecting Alumni and Students ——To maximize the number of 
ratings for the instructors participating in the study, it was 
decided to select alumni and students from schools and depart- 
ments in proportion to the representation of eligible instructors. 
This meant that the students were not selected as strictly repre- 
sentative of the total University student enrollment. It was 
felt, however, that this was not a major consideration. In a 
previous study at Purdue University it was found that student 
ratings were unaffected by a number of investigated factors. 
Freshmen rated their instructors no higher and no lower than 
did seniors, male students rated their instructors no differently 
than did female students, veteran students rated their instruc- 
tors similarly to non-veterans, and students in the upper half of 
the class rated their instructors like those in the lower half.’ 

At Brooklyn College, in an extensive teacher-rating project 
the evidence indicated that the ratings given to an instructor 
were not affected by the student’s college class, sex or by the fact 
that the course he taught was an elective or prescribed course.’ 

It was decided to send Rating Scales and instructions to five 
hundred students and to five hundred alumni in the proportions 
indicated in Table 1 under per cent of eligible instructors. 
Within schools the sample of students selected was equalized by 
classes, equal numbers of freshmen, sophomores, juniors and 
seniors being chosen. For example, in Agriculture, eighteen per 
cent of the total sample of students or ninety students were to be 
chosen. This resolved into twenty-two freshmen, twenty-three 
sophomores, twenty-two juniors and twenty-three seniors from 
the School of Agriculture. In Engineering, since freshmen are 
not designated by Engineering department, the entire quota of 
Engineering students was divided by four to indicate the number 
of freshmen engineers to select. Then the sophomores, juniors 
and seniors were divided according to the quotas of their respec- 
tive engineering schools. 
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TABLE 1.—CoMPOSITION OF THE SAMPLE OF STUDENTS BY SCHOOL 
MADE TO CONFORM TO THE PER CENT OF ELIGIBLE INSTRUCTORS 


Per cent Number Per cent 
Eligible of Approximate 
Instruc- Students University 


School or Department tors Selected Enrollment 
Agricultural Engineering 2 8* l 
Chemical Engineering 3 15 8 
Civil Engineering 9 34 6 
Electrical Engineering 6 22 13 
Air Transportation, Mechan- 

ical and Aeronautical Engi- 

neering 6 22 24 
Freshman Engineering 34 
Agriculture 18 90 10 
Forestry 2 10 2 
Home Economics 6 30 9 
Pharmacy 4 20 4 
Physical Education 4 12 3 
Science 38 198 18 
Trade & Industrial Education l 5 2 
Totals 100 500 100 


* Numbers in departmentalized engineering groups are one-fourth less 
because of the allowance for freshman engineers. 


Students were selected for participation at random with the 
help of the current student directory. First the elements of the 
student sample were listed according to class and curriculum. 
Then for a given school and a given class, the total number of 
pages in the directory was divided by the number of students 
in that element. Every Qth page was then designated as a page 
for selecting one student for the given school and the given class. 
The rule was: select the first student whose name appears on that 
page who satisfies the class and school requirement. 

In selecting like alumni for participation in the study, the 
Bulletin of Purdue University for 1936-37 was used, since it listed 
the names of students by class and school enrolled for that year. 
Using the same system of selecting alumni as was used for select- 
ing students, the problem of obtaining the proper number of 
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alumni drop-outs at various stages of their academic careers to 
make the student and alumni samples comparable did not have 
to be met. We simply assumed the students in attendance in 
1936-37 were similar in ability to students in attendance in 
1948-49. The Alumni Association provided addresses for the 
alumni (graduates and non-graduates) we had listed. 

The Rating Scale and Instructions —The Rating Scales and 
instructions were mailed to students and alumni alike under the 
accompanying covering letter. 


PRESIDENT’S OFFICE 
Purdue University 
Lafayette, Indiana 
February 28, 1949 
Dear Alumnus or Student: 


Purdue’s standing among educational institutions depends largely on 
the quality of the teaching that goes on in its classrooms and lecture halls. 
There are, however, many unexplored areas of the problem ‘“‘ What makes a 
good teacher?” Attempts to supply an answer have included asking 
present students to judge their teachers, but such attempts have frequently 
been attacked on the grounds that the undergraduate student lacks the 
maturity for supplying valid judgments of this sort. It is proposed to 
investigate this type of criticism more closely with the help of students and 
alumni. 

As students, will your answer to the question “‘What makes a good 
teacher?” change as you grow older? Or, as alumni, have your ideas as to 
what makes a good teacher changed since your student days? These are 
two aspects of the question we wish answered. To help us find the answer 
I am calling upon you to coéperate in an investigation being made by the 
Division of Educational Reference. Your task, as explained in the enclo- 
sures, is quire simple and brief. It includes rating your past or present 
teachers at Purdue University. I wish to urge you to complete it thought- 
fully and at your earliest convenience. 

I should like to assure you that results of the study will never be identi- 
fied with any specific instructor by any member of the instructional staff. 
The ratings will be seen only by one or two confidential clerks. Your 
ratings will therefore not affect any instructor’s standing. All the teachers 
listed on the attached form have individually agreed to participate in the 
investigation. 

It will be obvious to you, I am sure, that your evaluation of instructors 
should be entirely your own. It is most important, therefore, that you 
make these evaluations without discussing them with anyone. 

I shall be grateful for your coéperation in this study. 

Sincerely yours, 
Frederick L. Hovde 


President 
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The student or alumnus was first asked to examine the list of 
traits appearing on an enclosed copy of the Purdue Rating Scale 
for Instructors and rank the traits from 1 to 10 as to their 
importance to good college teaching. A brief questionnaire was 
then provided requesting sex, age, year of graduation, total 
number of semesters in attendance at the University, school or 
division, current semester (if a student) and veteran or non- 
veteran status. Next the student or alumnus was asked to 
scan the list of one hundred two instructors’ names and to check 
the names of those from whom he could recall having had courses 
for at least five weeks of a semester. He was instructed how to 
select at random no more than five of those instructors for rating. 
Rating of all five instructors was done in the traditional manner? 
for all ten traits, except that all five ratings were placed on the 
same scale for each trait with the instructor identified by a num- 
ber. The rating scale and questionnaire were then returned by 
mail. 

The Returns——Of one thousand questionnaires and Rating 
Scales sent to students and alumni, two hundred fifty-one stu- 
dents and one hundred thirty-eight alumni submitted ratings in 
time for the analysis. The students returned a total of eight 
hundred thirty ratings and the alumni a total of five hundred 
seventy-seven ratings. The composition of the student and 
alumni groups which sent in ratings can be seen in Table 2. 

From Table 2 we can surmise that the samples returning rat- 
ings of instructors are fairly similar with regard to sex and school 
or department. Moreover, the proportions of response by both 
students and alumni by schools and departments were approxi- 
mately similar to the proportions in the original mailing. The 
larger number of veterans in the alumni group is to be expected, 
as are, of course, the differences in age. The most striking differ- 
ence between students and alumni returning ratings, however, is 
found in the number of semesters completed. It will be recalled 
that alumni were selected from a listing (student directory of 
thirteen years ago) including names of those who would fall by 
the wayside, scholastically, or would leave school for other 
reasons. Yet, apparently, almost none of the drop-out alumni 
receiving questionnaires returned them. We must admit, there- 
fore, the possibility that the alumni group may be slightly 
superior academically, notwithstanding the fact that the student 
group is heavily loaded with juniors and seniors. 
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The number of ratings received for any individual instructor 
by students or by alumni does not exceed fifty. In a few cases 
no students rated a given instructor and in a few more cases no 


TABLE 2.—CHARACTERISTICS OF STUDENTS AND ALUMNI 
RETURNING RATINGS 


Students Alumni 


Males 80% 84% 
Females 20% 16% 
Mean Age 21.8 yrs. 33.0 yrs. 
Age Range 18-31 30-50 
School, Division or Department 

Trade & Industrial Education 1% 2% 

Engineering 22% 28% 

Science 43% 34% 

Pharmacy 5% 5% 

Home Economics 4% 5% 

Agriculture 22% 24% 

Physical Education 3% 2% 
Semester of Work Completed 

1 0% 

2 9% 

3 3% 

4 18% 

5 9% 1% 

6 22% 

7 11% 1% 

8 28% 2% 

Graduated 0% 97% 
Veteran, with Overseas Service Before Sep- 

tember, 1945 26% 38% 


alumni gave a given instructor ratings. Analysis of results to 
make comparisons between ratings of instructors and of traits by 
alumni and students have to be confined to ratings made as a 
whole on all instructors as a group. 


RESULTS 


Desiderata of Good Instructors—To compare the relative 
importance to alumni and to students of the ten traits on the 





i 








136 The Journal of Educational Psychology 


Purdue Rating Scale for Instructors, the median rankings of 
each trait by the two hundred fifty-one students and the one 
hundred thirty-eight alumni were obtained. The Pearson 
product-moment correlation coefficient between these medians 
for the ten traits was .92. On the basis of the obtained medians 
the rank order correlation was also computed to be .92, either 
method indicating a strong degree of relationship between rela- 
tive importance attached to the various rating traits by students 
and alumni. In Table 3 one can see that seven of ten traits have 
identical rankings by students and alumni. Trait 2, ‘Sympa- 
thetic Attitude Toward Students,’ is ranked fifth in importance 
by students and sixth by alumni. Trait 3, ‘Fairness in Grading,’ 
is ranked sixth by students and eighth by alumni. Trait 6, 
‘Sense of Proportion and Humor,’ shows the greatest discrepancy 
between students’ ranking and alumni ranking in that it is ranked 
eighth by students but fifth by alumni. Table 3 lists the traits 
in the order of importance by student ranking and gives the 
median rankings by students and alumni. 

The difference in emphasis placed on such traits as ‘ Fairness in 
Grading’ and ‘Sense of Proportion and Humor’ by alumni and 
students lends itself readily to speculation. Grades unquestion- 
ably assume their greatest importance to the individual when 
set as immediate goals but the alumnus, as his academic days 
recede, must gradually realize the over-emphasis placed upon 
grades in their relation to later success. As for ‘Sense of Pro- 
portion and Humor,’ the student is possibly too much concerned 
with his academic progress, has too much seriousness of intent to 
place much premium upon this trait. The alumnus, on the other 
hand, possibly has realized the importance of fun and humor to 
individual adjustment and has learned how to relax when relaxa- 
tion is needed. 

These trait rankings compare favorably, although not per- 
fectly, with a much earlier study in which another group of 
students ranked the traits.‘ In general these students showed a 
high degree of agreement in their judgments of the relative 
importance of the ten traits, with the exception of ‘Fairness in 
Grading.’ The highest three in order of relative importance 
were ‘Presentation of Subject Matter,’ ‘Stimulating Intellec- 
tual Curiosity,’ and ‘Interest in Subject.’ The lowest were 
‘Personal Peculiarities,’ ‘Personal Appearance’ and ‘Sense of 


Proportion and Humor.’ 
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Comparison of Student-Alumni Ratings of Instructors.—In 
obtaining a comparison between student and alumni ratings of 
the same instructors two types of analysis appeared desirable— 
that which would give a measure of the degree of relationship 
between the ratings and that which would give a measure of the 
amount and direction of any average difference that occurred 
for a given trait. We will first discuss the correlations obtained 
between student and alumni ratings of the same instructors for 
each trait. 


TABLE 3.—RANKING OF PRSI Traits By STUDENTS AND ALUMNI 


Student Alumni 
Rank Rank St Al 


Trait Order Order Mdn Mdn 
Presentation of Subject Matter l l 9 1.3 
Interest in Subject 2 2 2.1 2.0 
Stimulating Intellectual Curiosity 3 3 3.5 2.5 
Liberal and Progressive Attitude 4 4 4.1 4.7 
Sympathetic Attitude toward Stu- 
dent 5 6 4.6 5.3 
Fairness in Grading 6 8 4.8 6.1 
Self-Reliance and Confidence 7 7 5.6 5.8 
Sense of Proportion and Humor 8 5 6.0 5.0 
Personal Appearance 9 9 8.7 8.7 
Personal Peculiarities 10 10 8.7 8.8 


In obtaining correlations between student and alumni ratings 
for instructors it was necessary to correlate the sum of a given 
number of alumni ratings with the sum of the same number of 
student ratings for each instructor. This procedure was repeated 
for ten traits, giving in all ten correlation coefficients. Here the 
paucity of our data first became apparent. We had the choice of 
basing our correlations upon the sum of two student ratings 
against two alumni ratings for the same instructor (in which case 
we would have seventy-seven out of the one hundred two 
instructors represented in the comparison) or of basing the corre- 
lations upon the sum of eight student ratings against eight 
alumni ratings which would enable us to use only seventeen out 
of the one hundred two instructors. Although the number of 
cases of instructors entering the correlation would be only seven- 
teen, it was decided to use the latter method because of the 
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superior reliability of the variables (the sum of eight ratings 
instead of only two). In an early study it was found that ratings 
by from ten to twenty students on a single trait for instructors 
‘differing sensibly in the amount of the trait possessed yield 
reliabilities which compare favorably with the reliabilities 
reported for standardized mental and educational tests.”* In a 
recent investigation ratings based on the average of ten ratings, 
randomly selected, of two hundred five instructors, correlated 
against ten other ratings of the same instructors, randomly 
selected, had reliabilities ranging from .72 to .87.! 


TABLE 4.—ESTIMATION OF RELIABILITY OF PRSI Traits BAsEep 
ON THE CORRELATION OF E1Ggut Raters Versus EIGHT 
OTHER RATERS 


Reliability! Reliability 

Based on 10 X 10 Estimated for 

Trait No. Ratings 8 X 8 Ratings 
1 .86 .83 
2 .76 yf; 
3 .72 .67 
4 45 71 
5 .87 .84 
6 .78 .74 
7 .83 .80 
8 .82 .79 
9 .83 .80 
10 .84 .81 


For this study, reliabilities of the average of eight randomly 
selected ratings against eight other randomly selected ratings 
were estimated from the above study! by the Spearman-Brown 
Formula. These estimated reliability coefficients, shown in 
Table 4, vary from .67 to .84. This is evidence that the measures 
of student and alumni judgments that comprise the variables in 
our relationships are fairly reliable. Of course, to the extent that 
these measures are not reliable, the obtained relationships shown 
in Table 5 between student and alumni ratings are lower than 
they would be with a larger number of ratings per instructor. 

As can be seen in Table 5, the correlation coefficients range 
from .40 to .68, indicating a substantial correspondence between 
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student and alumni ratings of the same instructors. These 
measures indicate that to a large extent the same relative ratings 
are given to instructors by current students whose average age is 
about twenty-two years and by alumni with an average age of 
thirty-three years. 


TABLE 5.—COEFFICIENTS OF CORRELATION BETWEEN THE SUM 
oF E1gut STUDENT RATINGS AND THE SuM OF EIGHT ALUMNI 
RATINGS OF SEVENTEEN INSTRUCTORS FOR TEN PRSI 


TRAITS 
Traits r 
1. Interest in Subject .40 
2. Sympathetic Attitude toward Students .68 
3. Fairness in Grading 47 
4. Liberal and Progressive Attitude 59 
5. Presentation of Subject Matter .58 
6. Sense of Proportion and Humor .46 
7. Self-reliance and Confidence .64 
8. Personal Peculiarities .50 
9. Personal Appearance .65 
10. Stimulating Intellectual Curiosity .46 


Next it was desirable to investigate the actual differences that 
occurred in comparing ratings by students and alumni. For 
this analysis it was possible to use most of the ratings received, 
or all the ratings on ninety-two of the one hundred two instruc- 
tors on whom we had at least one rating by both students and 
alumni. The procedure for each of the ten traits was simply to 
compute, regardless of the number of raters, the mean alumni 
and student rating received by each instructor as representative 
of student or alumni judgment concerning that instructor. 
These mean ratings were then averaged for the ninety-two 
instructors and the average differences tested by computing the 
standard error of the differences between means, taking into 
account the correlation between the measures. Based on the 
data from this investigation, a significant difference between 
these averages for a given trait would then represent a true 
tendency for students or alumni to rate instructors higher or 
lower by a given amount. 

On three traits, ‘Interest in Subject,’ ‘Sense of Proportion 
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and Humor’ and ‘Self-Reliance and Confidence,’ students tend 
to rate instructors higher than do alumni. These differences of 
4 to 5 points are significant at the ninety-nine per cent level of 
confidence. On one trait, ‘Fairness in Grading,’ alumni tended 
to rate instructors about 3 points higher than did students. 
This difference was significant at the ninety-five per cent level of 
confidence. Complete results of this type of analysis are given 
in Table 6. In addition to the above significant differences 
which we have pointed out, there appears a slight tendency for 
students to give slightly higher ratings than alumni on most traits. 
Alumni are also considerably more variable in their ratings on 
Traits 1, 7 and 9. 

The tendency for students to rate instructors lower than do 
alumni on ‘Fairness in Grading’ may possibly reflect again the 
abnormal concern of undergraduates with grades and grading 
methods. For the rest of the traits, however, either there is no 
significant difference or the students tend to rate instructors 
slightly higher than do alumni. 

Teacher criticisms of the Purdue Rating Scale for Instructors 
made on grounds of student immaturity probably are founded 
upon the belief that undergraduates are not likely to give instruc- 
tors their due. Our results suggest otherwise; students of this 
study have tended to rate the instructors of this study higher 
than have alumni on two traits but not much differently than 
alumni on the other eight traits. On the other hand, superior 
ratings by students may reflect just those changes that have 
occurred in teaching during the past decade, leaving the distinct 
possibility that there actually are no differences in the way alumni 
and undergraduates rate the same instructors. In all fairness, 
however, we should consider the possibility that the change for 
the better in teaching ability of the ninety-two instructors was 
so great that these slightly superior student ratings actually are 
under-ratings of a superior teaching ability. 


SUMMARY AND CONCLUSIONS 


To test the hypothesis that ratings of instructors do not change 
with the maturity of the rater, comparisons were made of the rat- 
ings of ninety-two Purdue University instructors by one hundred 
thirty-eight alumni of at least ten years’ standing with ratings 
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by two hundred fifty-one current undergraduates. The relative 
importance of the ten traits of the Purdue Rating Scale for 
Instructors to alumni and students was also examined. The 
following tentative conclusions appear justified concerning the 
comparative ratings of Purdue University instructors by current 
students and alumni of ten to thirteen years ago: 

1) There are substantial positive relationships between the 
relative average ratings of instructors by students and alumni. 

2) The average differences that do occur between student and 
alumni ratings of Purdue University instructors indicate that 
present students as a group rate their instructors slightly higher 
than they will as older individuals, but that such differences are 
actually too small to be of practical significance. 

3) Purdue University students and alumni agree highly upon 
the relative importance to good instruction of the ten traits of the 
Purdue Rating Scale for Instructors. 

These conclusions hinge upon the satisfying of the general 
assumption that the selections of students, alumni and instruc- 
tors did not account for the results obtained. In particular, 
results in favor of student ratings over alumni ratings for the 
same instructors may reflect improvements in teaching traits of 
the ninety-two instructors over the period covered by the study. 

This study, then, has presented evidence that the judgments 
made by undergraduates of their instructors are valid ones in 
terms of permanence and maturity. Instructors should be able 
to avail themselves of the Purdue Rating Scale for Instructors 
with increased confidence that the judgments they receive 
thereby are worth-while ones in evaluating their own teaching 


personalities. 
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SOCIAL PERCEPTION AND TEACHER- 
PUPIL RELATIONSHIPS 


N. L. GAGE and GEORGE SUCI 


Bureau of Research and Service 
College of Education 
University of Illinois 


How two or more persons interact in a given situation depends 
on many variables, only a few of which have been isolated even 
conceptually. Properties of the persons and of their situations, 
all considered as interdependent aspects of a field of forces, obvi- 
ously provide handles by which to grasp the interaction phe- 
nomena. ‘This paper seeks to throw light on interactions between 
teachers and pupils as a function of one property of the teachers: 
the accuracy of their perception of pupils’ attitudes. 

We are concerned with the more affective dimensions of the 
interaction, rather than with cognitive ones. Our reasoning 
goes as follows: Behavior in any situation is a function of one’s 
perception of the situation. The ‘appropriateness’ of the 
behavior will hence depend, at least in part, on the correctness 
or accuracy of the perceptions. But mere perceptions do not 
govern behavior completely; the motivational system of the 
person will determine the ends to which his perceptions are used. 
Similarly, his ‘integration,’ or freedom from neuroticism, will 
determine how well he can control his behavior to accord with 
his motivations and perceptions. Thus, even if John perceives 
a stop light accurately, he will not stop for it unless he wants 
to and is sufficiently unexcited to be able to react in time. If 
John perceives his date’s boredom with a movie, he will not sug- 
gest that they leave unless he both wants to please her and this 
motive is integrated in a given way with his own feelings about 
the movie. 

So far, however, this formulation ignores the relationships 
between these determiners; motivation and perception, at least, 
are themselves linked together. As Bruner! recently summarized 
it: 


‘“‘(a) Personally relevant objects in the perceptual field, 
whether the relevance is a function of positive or negative 
value to the individual, undergo accentuation. . . . 

““(b) Perceptual selectivity . . . is shown to favor the 
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recognition of valued objects and objects associated with the 
prevailing interests of the subject.”’ 


Applied to teachers’ perceptions of pupils’ attitudes and their 
interaction with pupils, this implies that the accuracy of teach- 
ers’ perceptions will be a function of the way in which teachers 
value pupils and relationships with them. Miss Smith will tend 
to become aware of how her pupils feel about various issues to 
the degree that she is interested in those pupils and in getting 
along smoothly with them. Similarly, she will probably act 
upon her awareness of their attitudes in positive ways for the 
same reasons that motivated her to perceive them accurately. 
Thus certain patterns of motivation, perception and behavior 
tend to go together. At the one extreme of a continuum, we 
may have teachers who do not value pupils positively, do not 
perceive their attitudes accurately, and behave in ways con- 
ducive to negative reactions. At the other extreme are teachers 
who ‘need’ their pupils more, perceive them with greater sensi- 
tivity and understanding, and conduct themselves in ways that 
elicit positive affect from pupils. 

But, it may be objected, need does not necessarily lead to 
accuracy of perception. The more an object is valued, the more 
indeed may perception of it be distorted. In our framework, 
this leads us to examine what it is we are asking teachers to per- 
ceive. This, as will be described below, was the attitudes of 
pupils as reported by themselves in answer to direct Yes-No 
questions. Whether the attitudes so reported are ‘true’ or ‘real’ 
ones, we cannot know. It is reasonable to suppose, however, that 
what pupils report about themselves in a nonclinical, classroom 
situation are probably opinions of an ego-supporting kind. The 
opinions of the pupils tended, in other words, to be those which 
would enhance their own view of themselves. When teachers are 
then asked to estimate these opinions, they are being asked to 
judge what are the ego-supporting responses of pupils. Teachers 
who value pupils positively then should come closer to the pupils’ 
own picture of themselves. Such a ‘coming closer’ is what we 
have here termed ‘accuracy of social perception.’ 

In short, this should mean that teachers who perceive their 
pupils’ attitudes more accurately should be regarded more favor- 
ably by their pupils. To test this inference, we need estimates 
of (1) the accuracy of teachers’ perceptions of pupils’ attitudes, 
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(2) the favorableness of pupils’ attitudes toward their teachers, 
and (3) the relationship between (1) and (2). 


THE ESTIMATE OF ACCURACY OF SOCIAL PERCEPTION 


Ideally, we should deal with the teacher’s perception of atti- 
tudes of individual pupils on matters that are deeply relevant to 
each pupil. This would mean, first, the identification of attitude- 
objects that are highly salient for each pupil and, second, asking 
the teacher to estimate that pupil’s attitude toward those 
objects. The attitudes to be perceived would then differ from 
one pupil to the next. Such a procedure was manifestly too 
difficult for the present study. We compromised by selecting 
areas of attitude that we surmised to be significant to the pupils 
en masse, and we asked the teachers to estimate the group attitude 
on each topic. This procedure is admittedly gross; but what- 
ever support for our hypothesis might emerge from such an 
approximation might then be considered an underestimate of its 
true validity. 

Accordingly, we proceeded as follows: The twenty teachers of 
a high school were asked to estimate what per cent of the two 
hundred pupils in the school would respond affirmatively to a set 
of sixty-seven opinion items. The items dealt with scholastic, 
recreational, and student government issues in their school.* 
An illustrative item of each kind follows: 

Scholastic: “Should students with high scholastic ratings be 
allowed to skip classes?”’ 

Recreation: ‘‘Could the time spent in school activities be used 
better for studying or working?” 

Student government: ‘‘Should Student Council members be 
appointed by teachers rather than elected by students?” 

The students responded anonymously, underlining ‘ Yes’ or ‘ No’ 

for each item. The pupils’ ‘Yes’ responses were tabulated and 

the percentage of Yes’s found for each item. These percentages 

comprised the scoring key. Each teacher was scored by taking 

the difference between his estimate and the actual percentage 

for each item, summing, and, because a few teachers omitted 

some items, averaging over-all items. The signs of the differ- 

ences were disregarded. Such ‘mean error’ scores were also 

obtained separately for each of the three subgroups of items. 





* These items were adapted from those used by Wood.‘ 
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These ‘mean error’ scores are directly dependent on the abso- 
lute accuracy with which each percentage is predicted. But a 
teacher, although not able to predict accurately in terms of 
absolute percentages, may be able to rank the items in order of 
relative acquiescence of the pupils. Accordingly, we also com- 
puted for each teacher a Pearsonian r between actual and pre- 
dicted percentages. This we called an ‘r-score.’ It correlated 
.77 with the ‘mean error’ score, showing a possibly important 
difference between the two scoring methods. 


PUPILS’ INTERACTION WITH THEIR TEACHERS 


At a separate meeting, with the teacher absent, the pupils 
rated their six current teachers on fifty-two Yes-No-? items* 
typified by the following: 

“8. Is this teacher often ‘bossy?’”’ 

‘14. Does this teacher think he or she is always right and the 
student wrong?” 

Again the papers were unsigned and pupils were assured their 
responses would remain anonymous. A score for each teacher 
was obtained by scoring favorable and unfavorable pupil 
responses as 1 and 0, respectively, summing over-all items and 
averaging over-all pupils rating the given teacher. This measure 
we considered an approximation, adequate for this study, to the 
positive affect elicited in pupils by their interaction with a given 
teacher. 

Furthermore, for an appraisal of each teacher’s system of 
values in teacher-pupil relationships, we used the Teacher Attitude 
Inventory developed by Leeds and Cook.*f The two hundred 
thirty-nine items in this device had been empirically selected and 
weighted by Leeds and Cook for their ability to discriminate 
between teachers known to maintain very satisfactory and very 
unsatisfactory relations with pupils, in the judgment of the 
teachers’ local school administrators. 


RESULTS 


Our first task in analyzing the data was to estimate the gener- 
ality over items, or the internal consistency, of our approach to 





* These were a slight modification of those used by Leeds and Cook.’ 
t We are grateful to Professor W. W. Cook for permission to use this 
device. 
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accuracy to social perception. Table 1 shows the corrected odd- 
even coefficients for the ‘mean error’ scores on all items and on 
each of the three sub-groups. The coefficient for all items, .73, 
indicates that the accuracy of teachers in estimating percentages 
of acquiescence was fairly general over the items. Thus we have 
evidence that the task of judging opinion revealed individual 
differences among teachers that were substantially loaded with 
non-error variance. For the subgroups of items, these estimates 
yielded much lower coefficients. 


TABLE 1.—CoORRECTED Opp-EVEN RELIABILITY ESTIMATES FOR 
THE ‘MEAN ERROR’ SCORES ON OPINION ESTIMATES 








N = 20 
No. Odd Items /Even Items} Cor- 
Type of Items of rected 
Odd- 


Items Mean! SD | Mean} SD Even r 





All Items 67 | 20.19/2.81) 18.64 /2.97 .73 
1-67 
Scholastic 23 «| 22.50/5.03) 19.74/4.05 .17 
1-23 
Recreation 22 | 15.47 (3.13) 18.16 )4.28 .28 
24-45 


Student Government} 22 /| 20.50/5.07| 17.26/4.95 47 
46-67 























Secondly, what is the reliability of the pupils’ ratings of their 
teachers? Table 2 shows the results obtained from an appli- 
cation of Horst’s formula.? This coefficient, .93, indicates that 
the mean ratings we obtained would correlate very highly with 
subsequent sets of similar ratings, when our estimate of the 
variability of such further mean ratings is based on an average 
of the standard errors of those obtained. In short, the variance 
among the mean ratings is even more a non-error one; the ratings 
discriminated well among the teachers. 

Having established these estimates of reliability, we turned to 
the relationships with which we were primarily concerned. Is 
social perception, defined operationally as mean error in esti- 





Social Perceptions and Teacher-Pupil Relationships 149 


mating students’ percentages of acquiescence to opinion items, 
related to acceptance of the teachers by pupils, defined as the 
mean rating of teachers by pupils? Table 3 shows the pertinent 
coefficients of correlation. The r’s between mean rating and 
mean error for all items and for the three subgroups of items are 
—.37, —.16, —.30, and —.18, respectively. It is evident that 
all r’s are in the direction inferred from our hypothesis: the 
greater the teacher’s mean error in estimating student opinion, 
the lower his mean rating by students. None of these r’s are 
statistically significant. Our interpretation of this fact may, 
however, be tempered somewhat by the consistency in direction 
and size of the three subtest coefficients. These subtests provide 
in a sense three semi-independent trials of our investigation. 
The consistency of their results may therefore be construed as 
favorable evidence in addition to that provided by the over-all 
test’s r. 


TABLE 2.—EsTIMATE BY Horst’s METHOD OF THE RELIABILITY 
oF Pupits’ RatTincs oF THEIR TEACHERS 


og,” 
iz =i. Gs 
7 648.46 


I 
© 
oo 


where N = number of teachers 
n; = number of pupils rating teacher 7 


M,; = mean of the ratings for teacher 7 
o; = standard deviation of ratings for teacher 7 
om, = standard deviation of means for the N teachers 


When the teachers’ estimates of student opinion were scored in 
terms of their correlation with, rather than their difference from, 
the actual percentages, we obtain the results also shown in 
Table 3. Here the relationships, using r and z, as measures of 
accuracy of social perception, increase to .50 and .45, respectively. 
Both these coefficients meet the requirement for statistical 
significance at the five per cent level. 

Why do the r-scores correlate more highly than the mean error 
scores with the pupils’ ratings? We can only conjecture as 
follows: The r-scores are not influenced, as are the mean-error 
scores, by individual differences among teachers in ‘adaptation 
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level,’ or ‘anchor-point.’ The teachers differed very significantly, 
beyond the one per cent level, in the means of their predicted 
percentages. The r-scores eliminate the effect of these differ- 
ences. They render the measure of accuracy in social perception 
dependent solely on relative rather than absolute judgments of 
degree of acquiescence. 

Two further findings should be mentioned. First, as shown in 
Table 3, the scores on the Cook-Leeds inventory correlated .57 
and .46 with our mean error and r-scores, respectively. This 
indicates that the attitudes and understandings concerning pupils 
which are tapped by that test are significantly related to ability 
to estimate students’ opinions. But what of the relation of the 
Cook-Leeds instrument to mean ratings by pupils? Contrary to 
the Cook-Leeds findings, our coefficient was negative, —.20, 
although not significantly different from zero. In this school, 
at any rate, this test did not yield results in corroboration of the 
significant positive relationships with pupils’ ratings (about .45) 
previously reported by Cook and Leeds. 


DISCUSSION 


All our conclusions must be tempered by an awareness of the 
smallness of our sample and its lack of replication in other schools. 
For example, our impression that a curvilinear relationship exists 
between pupil ratings and accuracy of perception cannot ade- 
quately be pursued. It should be reported, however, that six 
teachers with high mean ratings nevertheless made only mediocre 
‘mean error’ or ‘r’ scores. None of the teachers with very low 
ratings made high accuracy in social perception scores. This 
could have occurred if some of the well-liked teachers did not 
attack the task of estimating pupils’ opinions with sufficient 
motivation. Thus low ratings seem sufficient to indicate poor 
perception scores, but the converse is not true. 

The instability of our results is further indicated when we omit 
one extreme case from the calculations of the r’s between the two 
scoring methods and the mean ratings. With the one case 
retained, r’s of —.37 and .50 were obtained with the ‘mean error’ 
score and the ‘r-score,’ respectively. With the extreme case 
omitted, the r’s became —.24 and .18, respectively—a consider- 
able reduction. Nevertheless some degree of correlation, in the 
expected direction, persisted. 
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SUMMARY 


To test the hypothesis that accuracy of social perception is 
positively related to effectiveness of interpersonal relations, we 
asked the twenty teachers of a high school to predict the per- 
centage of the two hundred students who would answer ‘ Yes’ 
to each of sixty-seven items eliciting opinions on various aspects 
of the school. Teachers’ predictions were scored in terms of 
(a) ‘mean error’ from actual percentages and (b) correlation 
with actual percentages. The teachers were rated by their pupils 
on a fifty-two-item inventory. Finally, the teachers filled out 
the Cook-Leeds Teacher Attitude Inventory. 

Results support the hypothesis. The teachers’ ‘mean error’ 
scores had fair internal consistency (corrected odd-even r, .73) 
and correlated —.37 with mean rating by pupils. Their ‘r-scores’ 
correlated .50 with mean rating. The teachers’ scores on the 
Teacher Attitude Inventory correlated .57 with the ‘mean error’ 
scores but, unexpectedly, —.20 with meanratings. (For N = 20, 
an r of .44 is significant at the five per cent level.) 

Our results must, of course, be pursued through replications in 
other schools and other situations. Tentatively, however, we 
conclude that teachers’ accuracy of social perception is posi- 
tively related to their effectiveness in eliciting positive affect in 
pupils. 
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THE READING EASE, HUMAN INTEREST VALUE, 
AND THEMATIC CONTENT OF ST. 
NICHOLAS MAGAZINE: A STUDY 
OF CHILDREN’S LITERATURE 


BARBARA SWARD and DALE B. HARRIS 
Institute of Child Welfare, University of Minnesota 


Perhaps the best known and most widely used technique for the 
analysis of written prose has been proposed by Flesch.?:** His 
formulae are extensively used by journalists, writers of articles 
for trade magazines, and others. Although many techniques 
for the analysis of educational material prepared for children 
have been developed, little work has been done on prose specifi- 
cally designed for children’s entertainment. 

Anyone professionally concerned with child development or 
education is familiar with the impression that stories and books 
for children have improved greatly in recent years. There is a 
widespread impression that children’s literature of the Victorian 
era was stilted, artificial, highly moralistic. The writers, inter- 
ested in this subject and in the adaptation to children which has 
presumably occurred in the informed adult world as a result of 
scientific study of children, here attempt an analysis of two 
problems. They are interested in the ‘readability’ of selected 
children’s material of an earlier day and of the present. They 
are also interested in the ideological content of this material, and 
how it may have changed over the years, due to changing con- 
ceptions both of children and of social values. 

For the first problem, the Flesch formulae suggest themselves 
as convenient indices of ‘Reading Ease’ and of ‘Human Interest.’ 
The formulae, their rationale and derivation are available and 
will not be discussed here. For the comparative purposes of 
this paper, two monthly children’s magazines were selected. 
St. Nicholas, the ‘grandfather’ in its field, commenced publica- 
tion in 1873 under the editorship of Mary Mapes Dodge. It 
enjoyed wide popularity until it ceased publication in 1940. In 
an earlier day, its pages were crowded with the work of such 
noted writers as Louisa May Alcott, Kipling, Twain, Bret Harte, 
Thomas Bailey Aldrich, William Cullen Bryant and others. 
Child Life, which, of the modern children’s magazines, has the 
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longest continuous publication and the widest circulation, com- 
. menced publication in 1922 and enjoys considerable repute today. 

To obtain materials for analysis, the writers proceeded as fol- 
lows: Samples of prose from both magazines were taken from 
issues published in successive decades. In order to utilize the 
first issue of St. Nicholas, the November, February, May, and 
August issues of the year 1873-4 were selected to yield the first 
sample. Comparable issues of succeeding decades gave addi- 
tional samples. The same months were used for the Child Life 
series, but, because a complete file was unavailable to the writers, 
the analysis began with the 1925-26 period. Thus, counting 
both magazines, a seventy-three year period is covered, 1873 to 
1946, inclusive. 

For each selected monthly issue of each magazine (twenty- 
eight issues for St. Nicholas and twelve for Child Life), the first, 
third, and fifth fictional stories provided the material for analysis. 
From each story, two one-hundred word samples were taken, one 
sample beginning with the third paragraph and one sample 
beginning with the fifth paragraph.* The first, third, and fifth 
stories were used consistently except where this procedure would 
include a succeeding installment of a serial story already sampled, 
or another story by an author already sampled; then, the story 
immediately following the designated story was used.f Few, if 
any, of the monthly issues contained more than five or six such 
fictional stories. 

In each case, a fictional story was considered to be a story that 
was pure fiction or fictionalized history in prose (many of the 
Washington and Lincoln stories for children are of this latter 
type), and of at least a page or more in length. Plays were not 
used because extensive dialogue would make the ‘personal 
sentences’ percentages in the Flesch formula spuriously high. 
Serial stories were used, but in counting paragraphs, the synopsis 
section was ignored as being abbreviated and very non-personal. 

Flesch’s Reading Ease (R.E.) score is easy to compute and 
quite objective. It is based on the average number of words per 





* We selected the third and fifth paragraphs in an effort to cover ade- 
quately two types of writing. The third paragraph frequently caught the 
introductory variation or exposition, while the fifth usually included more 
dialogue or light, personalized writing. 

+t We avoided using more than one story by the same author because we 
were interested in the variety of writers representing a particular period. 
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sentence, and the average number of syllables per one hundred 
words. The Human Interest (H.I.) score involves greater sub- 
jectivity of judgment. It involves the number of ‘personal 
words’ and the number of ‘personal sentences’ per sample of 
one hundred words. ‘Personal words’ are not always self- 
evident. The writers encountered many nouns which could not 
be said objectively to possess masculine or feminine gender, yet 
because of their functional use in the story it seemed wise to 
consider them such. Examples are: Lady Hollyhock, Buster 
Bumblebee, Kipling’s mongoose Rikki-Tikki-Tavi, etc. The 
rule adopted was that if non-human characters were definitely 
personalized and possessed gender, nouns designating them were 
counted as personal words. If they were neutrally or realistically 
portrayed, they were not so categorized.* 


TABLE I.—MEDIAN AND SEMI-INTERQUARTILE RANGE VALUES 
FOR READING EASE SCORE DISTRIBUTIONS FOR SELECTED YEARS 


Magazine 
and Year Median R.E. Score Q Flesch Description of Style 


St. Nicholas 


1873-4 79.4 5.3 Fairly easy 
1883-4 76.7 7.7 Fairly easy 
1893-4 80.7 9.4 Easy 
1903-4 80.9 6.2 Easy 
1913-4 80.5 5.6 Easy 
1923-4 75.5 6.0 Fairly easy 
1933-4 80.4 8.3 Easy 
Child Life 
1925-6 75.6 7.3 Fairly easy 
1935-6 84.6 4.7 Easy 
1945-6 78.4 6.3 Fairly easy 


The median R.E. scores for St. Nicholas and Child Life are 
presented in Table I. The descriptions of style in the table are 
borrowed from Flesch’s designations for adult material. 





* This rule, that in deciding ‘personal words’ the functional use of the 
word in the paragraph, not necessarily the word’s absolute or commonest 
use, was applied throughout. Flesch does not state this rule. The problem 
does not occur in the newspaper studies where the Flesch technique has been 
most widely used. It is a definite problem in imaginative material prepared 
for children. 








om™eom 42 6450" CO 





156 The Journal of Educational Psychology 


The outstanding result of this analysis is the fairly uniform 
level of reading ease of the material over three-quarters of a 
century. One might expect that the stress in recent decades on 
simplified, pictorialized literature for children might have brought 
with it a somewhat higher typical R.E. score derived from the 
prose. The great change in format of children’s material is 
apparent to all, but the early issues of St. Nicholas, with their 
few, black-and-white pictures, and small print, were just as 
‘easy’ in prose as the modern children’s magazines. 


TABLE I].— MEDIAN AND SEMI-INTERQUARTILE RANGE VALUES 
FOR HuMAN INTEREST SCORE DISTRIBUTIONS FOR SELECTED 
YEARS 

Magazine 
and Year Median R.E. Score Q Flesch Description of Style 


St. Nicholas 


1873-4 51.6 19.7 Highly interesting 
1883-4 71.9 26.8 Dramatic 
1893-4 72.4 27.9 Dramatic 
1903-4 55.9 19.7 Highly interesting 
1913-4 56.5 18.0 Highly interesting 
1923-4 64.2 18.7 Dramatic 
1933-4 64.7 15.3 Dramatic 
Child Life 
1925-6 57.6 19.9 Highly interesting 
1935-6 64.4 15.0 Dramatic 
1945-6 57.0 21.2 Highly interesting 


Human interest scores are presented in Table II. The appar- 
ently greater variability among these medians must be inter- 
preted in the light of the greater basic variability in the score 
itself. The greatest difference—that between the years of 
1873—4 and 1893-4—‘s significant only at the five per cent level, 
statistically; that is, the chances are still five in one hundred 
that a difference as large or larger than the observed one could 
occur from ‘errors’ of sampling. Again, the high interest value 
of children’s literature in another era must be noted. Small 
wonder that Henry Steele Commager’s anthology* from Si. 
Nicholas has been so well received. 





*Henry Steele Commager (Editor), The St. Nicholas Anthology. New 
York, Random House, 1948. According to the publishers, approximately 
fifty thousand copies of this anthology have been sold. 
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Two additional analyses are perhaps of sufficient interest to 
note the general results in passing without presenting tabular 
detail. The reading ease of children’s material results more 
from the selection of short, simple sentences. An analysis of 
mean sentence length reveals the interesting fact that, judged by 
sentence length alone, children’s prose rates ‘fairly easy’ to 
‘fairly difficult’ by Flesch’s norms. When mean number of 
syllables per one hundred words is calculated, the Flesch norms 
of ‘easy’ or ‘fairly easy’ are satisfied. Again, these indices are 
surprisingly constant over the seventy-three-year period studied. 

In the St. Nicholas material, the November and February 
issues, for all years, were consistently above May and August 
issues in Human Interest score. The relationship to Thanks- 
giving Day and to Valentine’s Day, Lincoln’s and Washington’s 
birthdays is apparent. Probably the December issues would 
exceed even the Thanksgiving issues in this regard, although this 
hunch was not investigated. 

Interesting as the results of word counts may be, such analyses 
miss completely the content of the material. ‘Content analysis’ 
has attracted the attention of journalists and social psychologists 
for some years. Of several techniques investigated, that pro- 
posed by Child and his associates! and applied by them to third- 
grade readers appeared to be most useful. The method consists 
in analyzing the behavior of principal characters in selected 
stories according both to the presumed motives of those char- 
acters and to the consequences of their behavior. 

The psychological theory of behavior which lies behind this 
method is obvious. It is that children learn motives from observ- 
ing the motives of persons they see or about whom they read. 
The assumption is made that the written page is an important 
source of conduct motivation in children, a position which has 
been and is implicit in a great deal of educational enterprise. 
Regardless of whether children’s conduct is measurably affected 
by such material, one may analyze the material to discern the 
kinds of values set before children three-fourths of a century ago 
and more recently. 

For this ‘content analysis,’ stories are selected according to the 
following criteria: (a) stories must contain action by animate 
characters, (b) they must not be essentially humor stories, for in 
such stories behavior and its consequences are frequently dis- 
torted for comic effect, and (c) they must not limit behavior 
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descriptions to purely objective and external accounts—evalua- 
tion of behavior and its consequences must be implied (or at least 
the possibility of such evaluation must be implied). Judged by 
these criteria, the stories of St. Nicholas are excellently suited to 
analysis. They are full of people to whom many things happen; 
consequences of behavior are very apparent; it is easy to reach 
an evaluation of the consequences as desirable or undesirable for 
the character involved. 

Stories which satisfy the above criteria are then analyzed in 
thema. A thema is a ‘sequence of psychological events,’ con- 
sisting of (1) a situation confronting a person or animate being, 
(2) the behavior, internal or external, with which the person 
responds, and (3) the consequences of the behavior as felt by the 
person or animate being himself. This third aspect of the thema, 
the present writers would restate as ‘consequences of the 
behavior as felt by the person himself or as inferred to him by the 
reader.’ In Child’s work with materials prepared for third-grade 
children, consequences were usually verbalized. St. Nicholas, 
designed for more mature children, sometimes presents conse- 
quences in very subtle, inferential ways, almost beyond stated 
feelings at times. 

Once thema are identified, they are classified in terms of an 
elaborate set of categories based by Child on Murray’s® personal- 
ity ‘needs.’ The writers used Child’s categories and, with occa- 
sional simplifications, his definitions. The categories appear in 
Table III. Complete definitions of the categories are eliminated 
in the interest of brevity, but some examples follow (adapted from 
Child): 

Activity —Thema in which activity is engaged in for its own 
sake, and the emphasis in the story is on the value of the activity 
itself. 

Achievement.—Incidents in which characters show either a 
generalized ambition to succeed or a desire for specific physical, 
intellectual, or social success. 

Affiliation —Thema including manifestations of friendliness 
and good will, and of the desire to do things in company with 
others. Often demonstrated through respect, deference, sym- 
pathy, generosity and helpfulness. 

Nurturance.—Thema in which helpfulness and willingness to 
exert oneself to aid any creature that may be in need of assistance. 
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TaBLE III].—Tuemas Founp IN Eacu BEHAVIORAL CATEGORY 
FOR EACH OF THE Four SAMPLED YEARS OF S7’. NICHOLAS 





















































1873-4 1893-4 1913-4 1933-4 
nied No. Per No. Per No. Per No. Per 
cent cent cent cent 
Objectless Behavior 
Activity 8 7.4, 6 3.6, 4 1.7) 14 5.6 
Passivity 1 .9| 2 1.2; 2 .8| 0 0.0 
Sentience 2 1.8} 1 .6| O 0.0| O 0.0 
Elation 2 1.8) 2 1.2, 3 1.3) 2 8 
Total 13 | 11.9; 11 6.6) 9 3.8) 16 6.4 
Behavior Primarily in Rela- 
tion to Things and Events 
Cognizance 1 .9 10 | 6.0) 28 | 11.7) 24) 9.7 
Achievement 16 | 14.7) 23 | 14.0) 37 | 15.5) 34 | 13.8 
Construction 1 9 2 1.2; 5 2.1; 2 .8 
Imaginality 0; 0.0 6; 3.6 0] 0.0} 1 4 
Acquisition 4; 3.7; 6] 3.6 2 £8 5) 2.0 
Retention 2 1.8) 1 .6 0 0.0) O 0.0 
Order 2 1.8) 1 6) 1 4 1 4 
Total 26 | 23.8 49 | 29.6] 73 | 30.5| 67 | 27.1 
Altruistic Social Behavior | 
Affiliation 10 9.2; 17 | 10.2) 61 | 25.5) 70 | 28.3 
Nurturance 16 | 14.7; 22 | 13.3) 12 5.0) 11 4.4 
Succorance 6 5.5) 6 3.6, 9 3.8) 7 2.8 
Deference 5 4.6) 11 6.6) 4 1.7; 2 8 
*Praying for others 1 9 61 .65 0; 0.0; 0; 0.0 
*Appreciation 2 1.8) 3 1.8) 2 .8| Of; 0.0 
Total 40 | 36.7; 60 | 36.1) 88 | 36.8) 90 | 36.3 
Egoistic Social Behavior 
Aggression 3} 2.8 14; 8.5) 18] 7.619 | 7.7 
Dominance 8 7.3] 7 4.2) 11 4.6) 10 4.0 
Recognition 0; 0.0) 1 .6| 2 .8 0; 0.0 
Autonomy 4; 3.7) 5] 3.0 3] 5.4 2 8 
Rejection 8; 7.44 7] 4.2) 11 4.6'19 | 7.7 
Total 23 | 21.2; 34 | 20.5) 45 | 23.0) 50 | 20.2 
Avoidance Behavior 
Harmavoidance 4 3.7; 8 4.8 7 2.9' 9 3.6 
Blamavoidance 2 1.8 4 2.4 4 1.7) 9 3.6 
Infavoidance 1 .9| O 0.0| 3 1.3) 7 2.8 
Total 7 6.4) 12 7.2) 14 5.9) 25 | 10.0 
Total of totals 109 {100.0166 100 .0)229 100.0248 {100.0 





* Categories added by the writers. 
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Aggression.—Thema featuring displays of temper and of fla- 
grant disobedience; verbal forms of aggression such as criticisms 
and arguments; dominating others by bullying and teasing; 
physical attack and killing. 

Harmavoidance.—Efforts to escape or avoid sources of danger. 

The writers added two categories to those supplied by Child, 
which appeared necessary for the most accurate placement of 
thema. These are starred in Table III. 

For the most part, the assignment of thema for categories was 
easy. When a single act might fulfill several purposes at the 
same time, and hence be assigned to multiple categories, the 
judgment was made in terms of the behavior which was pre- 
dominant or most conspicuous. When consequences of behavior 
were classified under the headings ‘reward,’ ‘punishment,’ ‘no 
consequence’ (neither reward nor punishment), the permanent, 
last, or long-run consequences were considered, not the immedi- 
ate consequences. 

The sample of stories selected for analysis was drawn at twenty- 
year intervals, beginning with the year 1873-4. The first two 
stories of at least a page in length in each month’s issue were 
used. There were in all ninety-six stories, twenty-four in each 
of four samples at intervals of twenty years. Serial stories were 
not included, and in only two instances was it necessary to include 
more than one story by the same author. 

Results are summarized in Tables III to V. The first of these, 
Table III, shows the number of thema represented in each year’s 
sample, and the proportions within each year represented in the 
several categories. The increase in gross number of thema 
observed in twenty-four stories reflects the increased length and 
especially the increased complexity of individual stories, which 
came somewhere around the turn of the century. Early issues 
contained many short stories. More recent decades have wit- 
nessed the introduction of mystery and adventure stories, where 
brother and sister hero-and-heroine combinations wander in and 
out of complex situations, exploring haunted houses, succoring 
destitute widows, and having exotic adventures in foreign lands. 
Yet the percentages vary little among the categories of thema, 
from sample to sample. 

When the classes of categories are examined, the percentages 
appear very stable from period to period. There has been a 
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decline in the amount of ‘objectless behavior’ (activity engaged 
in for its own sake, emphasis on emotion or feeling, etc.) and a 
corresponding increase in ‘behavior primarily in relation to 
things and events,’ but the difference between 1873 and 1933 falls 
short of the five per cent level of significance in both instances. 
Certainly, the emphasis on more concrete, activity-centered 
stories has not increased over the years to the extent we perhaps 
have believed. 

The uniformly low percentages of avoidance behavior in all 
samples are of interest. Story characters have not and are not 
represented as seeking in their behavior to avoid physical harm, 
parental or social disapproval, fears, embarrassment, self- 
consciousness or shyness. In general, thema are distributed as 
follows: one-fifth to egoistic social behavior, one-third plus to 
altruistic behavior, and one-fourth to behavior primarily related 
to things and events, especially the gaining of knowledge and the 
desire for success. This is true both for the Seventies of the last 
century and the Thirties of the present. 

The only differences which are statistically significant and 
which show a trend, occur within the category ‘altruistic social 
behavior.’ Here, there is a significant increase in behavior 
designated as ‘affiliation-manifestations of friendliness and good 
will and of the desire to do things in the company of others; often 
demonstrated through respect, deference, sympathy, generosity, 
and helpfulness.’’ This increase is paralleled by decreases in 
‘nurturance,’ ‘succorance,’ and ‘deference.’ When extremes are 
compared, the decline in nurturance satisfies the one per cent 
level of significance. Nurturance is defined as ‘‘ manifestations 
of helpfulness or willingness to exert oneself to aid any creature 
that might be in need of assistance.’’ Succorance refers to 
‘behavior directed at obtaining protection, sympathy, or assist- 
ance from another person,’”’ while deference indicates ‘‘ behavior 
toward other persons involving (a) compliance with requests or 
commands, (b) politeness, (c) subservience, (d) respect for au- 
thority, and (e) imitation of other persons.’ In the last two 
categories, the trend is consistent but the shift is not statistically 
significant, at least in terms of samples of this size. 

Table IV gives the number of thema in the more popular cate- 
gories, all years combined, with the proportions in which the 
actor is rewarded, punished, or experiences no consequences of 
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TaBLE IV.—CONSEQUENCES OF BEHAVIOR IN SELECTED CATE- 


Goriges OF THeMas, S7’. NICHOLAS Stories 











Percentage in Which 
Behavior Was: 
No. of 
Category cen No 
Re- Pun- Conse- 
warded| ished : 
quences 
Objectless Behavior 
Activity 32 .35 .25 .39 
Behavior Primarily in Rela- 
lation to Things and Events 
Cognizance 60 .82 .05 13 
Achievement 109 .72 22 .06 
Construction 10 .50 13 oF 
Acquisition 17 .45 .33 22 
Altruistic Social Behavior 
Affiliation 158 .79 .03 .18 
Nurturance 61 71 .07 .22 
Succorance 29 .75 .18 .07 
Deference 22 71 10 .19 
Egoistic Social Behavior 
Aggression 54 44 31 .25 
Dominance 35 .39 .17 .44 
Autonomy 24 .32 .33 .39 
Rejection 45 .08 11 .81 
Avoidance Behavior 
Harmavoidance 27 .61 12 .27 
Blamavoidance 19 15 .28 .57 
Infavoidance 11 .21 .48 31 

















his behavior. In general, altruistic social behavior is rewarded 
Egoistic social behavior is much 


less frequently rewarded; it is somewhat more likely to result in 
no consequences than to be punished. 


about three-fourths of the time. 


In behavior primarily 


related to things and events, the seeking of knowledge (cog- 
nizance) is rewarded in the great majority of instances; behavior 
denoting the desire for material possessions (acquisition), or the 
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creation of a material or intellectual product for utilitarian or 
esthetic use (construction) is rewarded about half the time. 

When the thema, distributed in the various categories by year 
samples, are broken down according to type of consequences, the 
number in any category becomes so small as to be relatively 
meaningless. The material, therefore, is not reproduced here. 
The totals, however, showed that for all categories combined 
there has been relatively little shift over the years in the tendency 
to reward or punish the consequences of behavior in stories. 
Consistently in samples taken at twenty-year intervals, behavior 
is rewarded in about three of five thema; it is punished in about 
one of six thema. For the remainder of the instances, no conse- 
quences follow from the behavior. An examination of the 
several categories of thema, such as objectless behavior, altruistic 
social behavior, egoistic social behavior, etc., shows that there 
has been no trend over the years to reward or to punish such 
classes of behavior in greater or less amount. 


TABLE V.—CHARACTERISTICS PORTRAYED IN 24 STORIES SELECTED 
IN Eacu oF Four Years, ST’. NICHOLAS MaGaziIne 














1873-4 1893-4 1913-4 1933-4 

N Per N Per N Per N Per 

cent cent cent cent 
Child 35 52.2 | 31 32.0 | 41 31.8 | 29 21.6 
Adult 23 34.3 | 39 40.2 | 75 58.1 | 82 61.2 
Animal 4 6.0 | 16 16.5 5 3.9} ll 8.2 
Fairy 0 0.0 5 5.2 3 2.3 2 1.5 
Groups of the above| 5 7.5 | 6 6.2; 5 3.9 | 10 7.5 
Total 67 | 100.0 | 97 | 100.1 |129 | 100.0 {134 | 100.0 





























Table V contains some additional data of interest. The total 
number of characters portrayed, per sample of twenty-four 
stories, has increased steadily. This is in keeping with the 
observation made earlier, that stories are longer and more com- 
plex now-a-days. The proportion of child characters has been 
cut in half over the years covered by this survey, and the pro- 
portion of adult characters has almost doubled.* 





* These figures may, in part, reflect an increasing problem which the 
writers faced—that of determining the age of child characters. We took 
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It should be clear to the reader that this content survey covers 
only one source of children’s literature, and a top-grade one at 
that. Yet, one impression stands out. Despite two world wars, 
an industrial revolution which has altered almost beyond belief 
the material comforts and conveniences of living, very marked 
alterations in habits of family living, major shifts in our economic 
ideologies, and tremendous advances in the format of children’s 
reading materials themselves, the content of this sample of 
children’s literature has altered very little indeed. To be sure, 
in this study we have established no absolute norm against which 
to measure change. Possibly, another set of content categories 
would give more evidence of change. Yet, the categories used 
appeared adequate to the writers; none disappeared, and no new 
ones were required for analysis of more recent materials. 

The writers have carried from their study one or two impres- 
sions which cannot be substantiated from the tables, but which 
they feel inclined to report anyway. As stories become longer 
and more complex, there occurred a subtle shift in how child 
characters were portrayed. Children in stories of an earlier 
day were recognized as immature. Their interests, activities, 
preoccupations were not those of adults. They met failure and 
success, they were rewarded or punished, but as children, not as 
‘little adults.’ Stories of a later era not only included fewer 
child characters but presented children more frequently as solv- 
ing problems which baffled adults. While the industrious child 
of an earlier era was likely to win out over economic odds eventu- 
ally, the boy detective of a later era was just as likely to solve a 
baffling mystery. The child character of an earlier day won by 
using materials at his disposal; the child character of a later 
period won by using, many times, the equipment and materials 
ordinarily available only to adults. 

It has often been said that one major change in the thinking of 
adults in the last one hundred years has been to accord to children 
a life of their own, appropriate to recognized developmental 
levels. In one sense, however, the children’s literature of an 
earlier day achieved this recognition somewhat more successfully 
than children’s literature of a more modern time. 





age sixteen as a division point, and when ages were not indicated, we made 
the best judgment possible in terms of the content. The trend is too 
marked, however, to be accounted for in terms of unreliability. 
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The writers would like to suggest that somewhat different 
ideological dimensions, developed around the hypothesis just 
advanced, would be worth studying in samples of children’s liter- 
ature. Religious stories and pamphlets for children should be 
included, likewise. Possibly, here evidence of greater adaptation 
might befound. As far as St. Nicholas magazine is concerned, it 
was surprisingly modern, even in its early days. Whether that 
be an accolade to St. Nicholas’ early issues, or a condemnation 
of its later years, only further surveys of other materials for 
children can tell. 
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CORRECTIVE READING: A MEANS OF 
INCREASING SCHOLASTIC ATTAIN- 
MENT AT THE COLLEGE LEVEL 


DOROTHY J. McGINNIS 


Associate Director, Psycho-Educational Clinic 
Western Michigan College 


The purpose of this study is to describe and evaluate materials 
and procedures employed in doing corrective work in reading at 
the college level. Appraisal is made not only in terms of differ- 
ences in initial and final scores on a reading test, but in point- 
hour-ratios as well. In making this evaluation one control and 
one experimental group have been equated, resulting group data 
have been compared statistically, a brief history of an individual 
in the Reading Laboratory has been presented, and tentative 
inferences have been made. 


DESCRIPTION OF MATERIALS 


(1) The primary source of materials used in the Reading 
Laboratory at Western Michigan College is the Reading Manual 
and Workbook.! This book contains twenty-two specific assign- 
ments, each of which has been designed to aid the student in 
developing a definite reading ability and skill. The reading 
assignments are related to actual classroom situations and the 
student is given an opportunity to apply the suggestions as he 
does his daily class preparation. Reading and studying are con- 
sidered together. Assignments show how: 

(a) To discover the nature and extent of the student’s reading 
difficulties. (b) To aid in interpreting reading difficulties and 
in selecting the assignments upon which the student is to work. 
(c) To investigate the possibility of a visual defect which would 
interfere with reading performance. (d) To improve the stu- 
dent’s ability to use the different parts of abook. (e) Todevelop 
skill in reading for a particular point and for general ideas. (f) 
To develop ability and skill in outlining. (g) To develop skill 
in reading a chapter effectively and well. (h) To develop skill 
in associating facts obtained from a textbook with facts already 
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known. (i) To learn how to concentrate upon a reading activ- 
ity. (j) To increase rate of reading without interfering with 
comprehension. (k) To develop skill in the use of the dictionary. 
(1) To increase reading vocabulary and to develop adequate 
skills in word study. (m) To develop skill in interpreting a 
short poem. (n) To develop skill in reading the newspaper 
effectively and well. (0) To develop ability and skill in problem- 
solving, especially in such areas as mathematics, chemistry, or 
physics. (p) To develop ability to read a table or graph. (q) 
To develop ability to read a map. (r) To develop skill in the 
use of the card catalogue. (s) To develop ability in using the 
Readers’ Guide. (t) To develop skill in applying what is read 
to the solution of one’s personal problems. (u) To develop 
skill in reading selectively and in preparing for examinations. 
(v) To appraise the change of reading skills and abilities acquired 
during the time between the initial and the final testing. 

(2) Orientation Inventory.—This inventory, which is a part of 
Reading Manual and Workbook, has been designed to help the stu- 
dent recognize and pick out the reading abilities and skills which 
he does not possess and at the same time it furnishes information 
concerning his reading background and certain physical and 
emotional factors which may affect his reading performance. 

(3) Equivalent forms of the Iowa Silent Reading Test* have 
been used. ‘‘The test is designed to cover a wide range of the 
skills known to be indispensable to effective reading of the work- 
study type. The test measures three broad general areas of 
silent reading abilities; namely, (a) Rate of Reading at a Con- 
trolled Level of Comprehension, (b) Comprehension of Words, 
Poetry, Sentences, Paragraphs, and Longer Articles, and (c) 
Ability to Use Skills Required in Locating Information.”’ 

(4) Films for the Improvement of Reading.t—These sixteen 
films, consisting of selections taken from a national journal, are 
intended to make each student read as swiftly as he can com- 
prehend and no faster. This is done by projecting on the screen 
a printed page of film on which a broad band of light slides 
smoothly from left to right and from line to line. The student 
is not forced to stay within the band of more brightly lighted 
words but he is persuaded to attempt to follow this pacer. 
Comprehension check questions and transfer selections accom- 
pany the films. 
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PROCEDURE FOLLOWED IN THE READING LABORATORY 


Persons who enter the Reading Laboratory do so because of 
their own initiative or as a result of advisement. No one is 
compelled to enroll. The greater proportion of individuals begin 
their work during the first week of the semester. Students, how- 
ever, are permitted to enter any time if there is room in one of 
the sections. Each section is limited to twelve. 

At the first meeting the members of the group are given the 
Orientation Inventory to complete. Next they are shown how 
to score this inventory and how to prepare a profile sheet which 
is of value to both the student and the instructor in interpreting 
factors underlying the reading difficulty. 

During the next two meetings of the group Form Am of the 
Iowa Silent Reading Test is administered. The administration 
of this test requires approximately ten minutes of the third 
laboratory period. The remainder of the hour is spent in 
acquainting students with what has been accomplished by other 
people in the Reading Laboratory and an attempt is made to 
have them realize that they can improve their reading ability 
and at the same time facilitate their preparation for their regular 
college classes. The responsibility of learning to read is placed 
upon the student. 

During the fourth meeting of the group the results of the Iowa 
Silent Reading Test and the Profile Sheet for Orientation Inven- 
tory are interpreted and each person is given an opportunity to 
ask questions concerning his performance. Class discussion is 
encouraged at this time. If physical or emotional abnormalities 
are indicated by the Orientation Inventory, the services of the 
Health Service or Psycho-Educational Clinic are requested. 

The student is next asked to indicate on the Master Work Sheet 
the assignments upon which he needs to work in order to improve 
his reading ability. This Master Work Sheet is based upon the 
results of the Iowa Silent Reading Test and the Orientation 
Inventory. 

Each person then completes Assignment III at which time he 
and the laboratory clinician investigate visual defects which may 
interfere with reading ability. If significant visual difficulties 
are found, he is referred to an ophthalmologist. 

From this point on each individual is permitted to work upon 
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the assignments indicated on his Master Work Sheet. He is 
‘on his own.’ Seldom, if ever, do any two students have the 
same pattern of assignments. Each person is allowed to progress 
from assignment to assignment at his own rate and in accordance 
with hisown plan. Some may spend days and even weeks upon one 
assignment while others may require much less time. The stu- 
dent, with the aid of the laboratory clinician, determines when he is 
ready for the next assignment indicated on his Master Work Sheet. 

Individuals in the Reading Laboratory are urged to apply the 
suggestions made in the laboratory to their outside study. Many 
apply the various assignments in their class preparation of history 
and English, while others use their textbooks in biology and 
chemistry as a means of increasing their reading abilities and 
skills. Adjustment of materials to the reading abilities of the 
students is essential and frequently becomes a real problem for the 
instructor. 

When the enrollee and the instructor agree that the purpose for 
which the student has entered the Laboratory has been accom- 
plished, Form Bm of the Iowa Silent Reading Test is adminis- 
tered. Changes between initial and final test scores are inter- 
preted and discussed. Many individuals continue two and even 
three semesters in the Laboratory. 


PROCEDURE USED IN MAKING THE EVALUATION 


1) Twenty college students enrolled in the Reading Laboratory 
were selected to make up the experimental group. 

2) The Iowa Silent Reading Test, Form Am, was administered. 

3) After a period of sixteen weeks an equivalent form of the 
Iowa Silent Reading Test was given to the twenty individuals 
enrolled in the Reading Laboratory. 

4) Data resulting from the administration of the Iowa Silent 
Reading Test were tabulated and treated statistically. 

5) One control and one experimental group were equated. 
In equating the groups the individuals were matched according to 
sex, status in college, scholastic aptitude and ability in reading. 
The control group consists of twenty persons selected from the 
student body at large who have not had work in the Reading 
Laboratory, whereas the experimental group consists of twenty 
individuals who have been actively engaged in the work of the 
Reading Laboratory. Each of the two groups consists of fifteen 
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men and five women. Eleven individuals making up each group 
are freshmen, five are sophomores, two are juniors and two are 
seniors. For both groups the median score on the Ohio State 
Psychological Examination is 49.5 and on Part III of the same 
measure 59.5. The mean number of hours carried by each group 
was 15.2. Inspection of courses suggest that reading require- 
ments were approximately equal. 

6) The point-hour-ratios for one semester of the control and 
the experimental groups were tabulated and treated statistically. 

7) Data resulting from this study have been summarized and 
inferences have been tentatively set forth. 

8) In this study no attempt has been made to determine the 
permanency of gain in reading ability, whether or not there is an 
increased achievement in other language skills, and whether or 
not gain in reading achievement is accompanied by constructive 
personality changes. Future investigations could well be 
directed to an appraisal of these factors. 


TREATMENT OF DATA 


Data resulting from this study have been tabulated as shown 
in Tables 1 and 2. In order to determine the significance of the 
difference between the means obtained in the initial and final 
testing, the single group method of obtaining the standard error 
of the difference between correlated means of small samples was 
used and ‘t’ was found. In determining whether or not differ- 
ences between the experimental and control groups were sig- 
nificant, the standard error of the difference between two uncor- 
related means of small samples was found and ‘?’ was calculated.’ 


BRIEF HISTORY OF AN INDIVIDUAL WORKING 
IN THE READING LABORATORY 


Kenneth, who is eighteen years old, was graduated with a poor 
academic record from a large high school. Inability in reading 
was reported to be the chief cause of his poor school adjustment. 
However, corrective or remedial treatment in reading was not 
provided. After being refused permission to enter two mid- 
western colleges because of poor high-school grades, Kenneth 
sought the aid of the Psycho-Educational Clinic. The Wechsler- 
Bellevue Intelligence Scale revealed intelligence quotients of 
119, 116, and 121 on the verbal, performance and full scales, 
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respectively. Performance on the Iowa Silent Reading Test, 
Form Am, was equivalent to that of an individual eight months 
in the fifth grade (5.8). His percentiles on the Ohio State 
Psychological Examination and on the reading section of the 
same test were 23 and 20, respectively. The clinic suggested 
that Kenneth enter the Reading Laboratory at Western Michigan 
College where he also was permitted to enroll for fifteen hours of 
academic work. Assignments completed in the laboratory were 
those designed to develop skill in chapter reading, concentration, 
skimming, word study, use of the dictionary and skill in selecting 
essential facts for review. At the end of one semester his grade 
score on Form Bm of the Iowa Silent Reading Test was that of 
an individual five months in the tenth grade (10.5). This gain 
is statistically significant. In his academic work Kenneth 
earned three B’s and three C’s, which is equivalent to a point- 
hour-ratio of 1.4. He has enrolled for the second semester in the 
Reading Laboratory. 


TABLE 1.—INITIAL AND Finau Iowa SILENT READING TEST 
Scores OF ALL STUDENTS IN THE EXPERIMENTAL GROUP 


Median Standard Score Initial Final 


200-204 
195-199 
190-194 
185-189 
180-184 
175-179 
170-174 
165-169 
160-164 
155-159 
150-154 
145-149 
140-144 1 


bt or 
Ww o 


Noe Whe Ww 


oO We bo 


Total 20 20 
Mean 169.5 185.25 
Sigma 14.2 13.95 
Standard error of the difference of the two means is 1.12, 
‘t? = 14.06. 
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SUMMARY OF DATA 


The distribution of initial and final scores of students enrolled 
in the Reading Laboratory on the Iowa Silent Reading Test is 
shown in Table 1. The correlation between equivalent forms 
was .94 and the degree of variability on both tests as indicated 
by sigmas of each distribution remained relatively constant. 

In Table 2 a distribution is shown of the point-hour-ratios of 
students who have been enrolled in the Reading Laboratory and 
an equivalent group of individuals who have not done corrective 
work in reading. Variability as determined by the sigmas of 
each distribution is comparatively constant. 


TABLE 2.—Point-Hour-Ratios oF STUDENTS IN CONTROL AND 
EXPERIMENTAL GROUPS 


Point-Hour-Ratio Control Experimental 
3.00 1 
2.75-2.99 
2.50-2.74 1 
2.25-2.49 2 
2.00-2.24 2 3 
1.75-1.99 2 3 
1.50-1.74 3 2 
1.25-1.49 3 2 
1.00-1.24 3 4 

.75-— .99 2 
.50-— .74 3 

.25- .49 3 

.00-— .24 1 
Total 20 20 
Mean 1.17 1.73 
Sigma .61 .605 


Standard error of the difference fof the two means is .196, 
‘t? = 2.86. 


Tables 1 and 2 show that persons enrolled in the Reading 
Laboratory made not only a statistically significant gain in read- 
ing ability as measured by initial and final tests, but also sta- 
tistically higher point-hour-ratios than an equivalent group of 
students who were not enrolled in the laboratory. In both 
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tables the differences between the means are significant at the 
one per cent level. 
INFERENCES 


1) The data of this study show quite conclusively that students 
who enrolled in the Reading Laboratory not only made a marked 
and statistically significant gain in reading as measured by an 
objective test, but also made statistically significant higher 
point-hour-ratios than an equivalent group of students who did 
not do corrective work in reading. 

2) In consideration of the facts presented in this study it is 
reasonable to assume that the materials and procedures employed 
in the Reading Laboratory are of value in improving not only the 
reading ability of college students, but their point-hour-ratios as 
well. This, in part, may be due to the nature of the materials 
used in doing corrective work. The assignments have been 
directed primarily to the accomplishment of academic attain- 
ment and not merely to an increase in reading rate. 

3) It is reasonable to assume that mechanical instruments and 
techniques used to facilitate adequate eye movement are useful 
adjuncts to the development of reading and study skills but that 
they cannot be regarded as sine qua non. 

4) Motivation is a factor to be considered in the interpretation 
of data presented in this study for each student entered the labor- 
atory voluntarily and not as a requirement. 

5) In interpreting data resulting from this evaluation consider- 
ation should be given to the possibility that some gain in achieve- 
ment may be due to the fact that students in the Reading Labora- 
tory have been singled out and given individual consideration and 
attention. 
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AWARENESS OF VOCABULARY SIZE: ITS 
RELATION TO CLASS STANDING AND 
SEX DIFFERENCES* 


JANICE LEVIN and CHALMERS L. STACEY 


Syracuse University 


INTRODUCTION 


To discover the extent of the difference between the amount 
of knowledge that we credit to ourselves and the amount we actu- 
ally possess is to discover our academic limitations and, not 
infrequently, the fact that we know more than we think we do in 
some respects and less in others. 

Any college class taking an examination can be divided into 
three groups on the basis of expected and obtained grade on that 
test. One group will be made up of students whose expected 
grades approximate their obtained grades. The other two 
groups will be composed of students whose obtained grades are, 
respectively, higher and lower than expected. Is there any rela- 
tionship between the students’ standing in the course and the 
deviation between expected and obtained grades? Does the 
better student possess greater awareness of the degree of his 
mastery of the essentials of the course than the poorer student? 
How does the average student compare in this respect with the 
twoextremes? Dosex differences play a réle in such estimations? 

That personality factors play an important part in the estima- 
tion of one’s future performance has been shown in the many 
investigations of level of aspiration behavior. These factors 
have, of necessity, been neglected in this study. 

Bear and Odbert’s! study of the ‘insight’ displayed by college 
students in evaluating their knowledge of word meanings led 
them to conclude that ‘the average student’s statement is a very 
uncertain guide to his performance on a test.’”’ Their two 
hundred eighty-two subjects indicated the words that they did 
not know in the material given them and were administered 
shortly afterward the vocabulary test from which the words were 
taken. The subjects missed sixty-four per cent of the words they 





* Presented at the meeting of Upper New York Psychologists at Syracuse 
University on November 5, 1949. 
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said they did not know and forty-four per cent of the words they 
claimed to know. 

It is the aim of this paper to study awareness of vocabulary size 
in relation to class standing and sex differences. 


DESIGN OF THE EXPERIMENT 


The subjects of this study were one hundred forty-three 
sophomore students, sixty-seven males and seventy-six females, 
enrolled in an introductory psychology course. In respect to 
the study of class standing, three groups were formed on the 
basis of the sum of the raw scores on four one-hour examinations 
during the semester: lower twenty-five per cent, thirty-six sub- 
jects; middle fifty per cent, seventy-one subjects; upper twenty- 
five per cent, thirty-six subjects. In the testing which followed, 
these groups were compared with each other and with the entire 
group in respect to the measures being considered. With the 
exception of a female student in the lower twenty-five per cent 
group, all subjects took the tests relevant to this study. In the 
course of the study of sex differences, the male, female, and entire 
groups were compared with each other on the measures under 
consideration. 

Development of the key.—Familiarity with words cannot justi- 
fiably be discussed in terms of a dichotomous relationship existing 
between knowing and not knowing the meanings of words. The 
continuum underlying word familiarity ranges from absolute 
unfamiliarity with the word to absolute certainty of its meaning. 
When subjects are forced to place words into ‘know’ and ‘don’t 
know’ categories, there is no way of determining how familiar 
they are with the words they say they know or how unfamiliar 
with the words they claim not to know. 

The familiarity of some words is beyond question. Some are 
more familiar in certain contexts than when by themselves or in 
other contexts. There are some words that we recognize and 
perhaps even use yet whose meanings are vague. Of the words 
whose meanings we do not know, there are some that we recog- 
nize and some that we cannot recall having seen or heard. These 
varying degrees of familiarity and strangeness of words, as dis- 
cussed by Bear and Odbert,! served as the basis for the key which 
was used by the subjects for the classification of the words and 
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phrases presented to them. The five items of the key were 
phrased as follows: 


1) I know this word (or phrase). I know what it means no matter 


where it may appear. 
2) This word (or phrase) is familiar to me. If it were used in a sen- 


tence, I would know what it means. 
3) I recognize this word (or phrase). I am not sure that its use in a 
sentence would help me to know what it means. 
4) I recognize this word (or phrase). I have no idea of its meaning. 
5) I have never seen or heard this word (or phrase). 


A Vocabulary Appraisal Sheet—The Michigan Vocabulary 
Profile Test served as the basis for the preliminary data sheet 
which was called ‘A Vocabulary Appraisal Sheet.’ This vocabu- 
lary test has eight subtests covering the fields of study and 
interest of students. In each of the subtests are thirty items 
arranged in order of increasing difficulty. The items either define 
or describe one of four words or phrases given. 

The last fifteen correct answers in seven subtests (the sports 
test was excluded) were listed on the appraisal sheet. Opposite 
each word or phrase were five boxes each of which represented 
one item of the key. The subjects were directed to familiarize 
themselves with the key items and the sample appraisal printed 
on the sheet and to express how well they felt they knew each 
word or phrase listed by placing an ‘X’ in the appropriate box 
next to each item. 

The vocabulary test was administered four weeks after the sub- 
jects had filled out the appraisal sheet. Raw scores on each 
of the subtests and a composite raw score were obtained for each 
subject. 

Combining key items.—On inspection of the item statements of 
the key, it will be noted that categories 1 and 2 indicate positive 
familiarity with the word; that is, within certain limits, the sub- 
ject would be expected to know those words placed in these 
categories. Category 3 implies that the subject is uncertain of 
his ability to define the word and may be considered an ‘uncer- 
tain’ category. Items 4 and 5 are statements of negative 
familiarity: one would expect that the subject would define 
correctly only that proportion of words placed in these categories 
that can be predicted on a chance basis. In view of the combina- 
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tive possibilities presented by the five-item key, three new cate- 
gories were evolved, the positive, uncertain and negative. 

The estimate scores are concrete demonstrations of awareness 
of vocabulary size. They represent the difference between 
estimation and performance. The positive estimate score is the 
difference between the number of words of those appraised which 
the subject claims to know and the number of words defined 
correctly on being tested. The negative estimate score is the 
number of words the subject says he does not know minus the 
number he fails to define on being tested. By definition, aware- 
ness of vocabulary size is a function of the estimate scores. 

The positive estimate score was obtained by subtracting from 
the total number of words placed by the subject in categories 1 
and 2, the total number defined correctly. A positive value 
of this measure indicates overestimation on the part of the 
subject of his ability to define the words presented. The nega- 
tive estimate score is the sum of words placed in categories 4 and 
5 minus the number missed. A positive value indicates that the 
subject has overestimated the number of words he would miss, 
or underestimated the size of his vocabulary in respect to these 
words. Positive and negative estimate scores were computed 
for each subject and averaged to yield group values. 


Stupy I: AWARENESS OF VOCABULARY SIZE AND CLASS STANDING 
AIMS OF THIS STUDY 


Defining awareness of vocabulary size as a function of the 
deviation between what the subject says he knows and what he 
knows on being tested, we may say that the subject who possesses 
greater awareness of his vocabulary size will define correctly a 
greater percentage of the words he claims to know than the sub- 
ject who is a poor judge. By the same token, one would expect 
him to overestimate and underestimate the number of words he 
knows or does not know to a smaller degree than the poorer judge. 

With a clearer understanding of the tools and techniques used, 
it is now possible to state the null hypotheses, the testing of 
which may be considered a restatement of the aims of this study. 

1) There is no difference between the three subgroups and 
entire group in respect to the mean percentage of correctly defined 
categorized words. 

2) There is no difference between the subgroups and entire 
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group in tendency to overestimate or underestimate vocabulary 


size. 
3) The correlations between the total raw scores on the vocabu- 


lary test and the semester raw scores for the subgroups and entire 
group are zero. 
RESULTS AND DISCUSSION 


In Table la are presented the mean percentages of correctly de- 
fined words placed in the categories of the five-item key by the sub- 


TABLE 14.—MEAN PERCENTAGES OF CORRECT DEFINITIONS 
oF Worps PLACED IN EACH OF THE CATEGORIES OF THE 
FIVE-ITEM KEry BY THE SUBGROUPS AND ENTIRE GROUP 


Category Lower 25% Middle 50% Upper 25% Entire group 


1 57.2 59.8 67.6 61.5 
2 46.0 41.8 52.1 46.6 
3 39.7 40.8 43.1 41.2 
4 29.0 32.3 41.5 34.3 
5 25.0 25.7 28.1 26.3 


groups and entire group. It will be noted that the upper group 
consistently defines a greater percentage of words correctly than 
the lower, middle, and entire groups. The middle group, with 
the exception of those words placed in category 2, defines a 
greater percentage of words than does the lower group. 

Of the words placed in category 1, words which the subjects 
said they would know under all circumstances, the lower group 
missed forty-three per cent; the middle group, forty per cent; the 
upper group thirty-two per cent, and the entire group missed 
thirty-eight per cent. In category 2, the lower group failed to 
define fifty-four per cent of the words they said they would know 
in context; the middle group, fifty-eight per cent; the upper 
group, forty-eight per cent; and the entire group fifty-three per 
cent. The lower group knew forty per cent of the words they 
placed in category 3; the middle group, forty-one per cent; the 
upper group, forty-three per cent; and the entire group defined 
correctly forty-one per cent of the words of whose meanings they 
were unsure. Of the words placed in category 4, words which 
the subjects recognized but said they could not define, the per- 
centage of correct definitions for the lower group was twenty- 
nine; for the middle group, thirty-two; upper group, forty-two; 
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and entire group thirty-four. A chance percentage of correct 
definitions was obtained for the words placed in category 5 by 
the subgroups and entire group. 

Table 1b presents the mean percentages of correct definitions 
for the four groups in the categories of the three-item key, The 


TABLE 18B.—MEAN PERCENTAGES OF CORRECT DEFINITIONS OF 
Worps PLAcep IN EACH OF THE CATEGORIES OF THE 
THREE-ITEM KEY BY THE SUBGROUPS AND ENTIRE 


GRouP 
Category Lower 25% Middle 50% Upper 25% Entire group 
Positive 54.4 56.6 64.2 58.0 
Uncertain 39.7 40.8 43.1 41.2 
Negative 26.9 29.0 31.7 29.2 


upper group consistently defines a greater percentage of words 
correctly than the lower, middle, and entire groups. The middle 
group assumes an intermediate position in each category with 
respect to the other two subgroups. 

Critical ratios computed between the mean percentages of 
correct definitions in each category of the five- and three-item 
keys for the three subgroups and entire group yield no significant 
differences between them in respect to mean percentages of 
correctly defined categorized words. 

The estimate scores of the subgroups and entire group are pre- 
sented in Table 2a. As a whole, the groups overestimate the 


TABLE 2A.—MEAN ESTIMATE ScorES, STANDARD ERRORS, AND 
RANGES OF THE THREE SUBGROUPS AND ENTIRE GROUP 


Estimate Lower Middle Upper Entire 
Scores Measures 25% 50% 25% Group 
Mean S.E. 18.1 1.7 17.4 1.2 9.8 1.7 15.7 .89 
Positive Range 0 to 43 -—6to38  -—13to28 -—13 to 43 
Mean S.E. —27.7 1.7 -—29.0 1.3 —21.71.4 —26.8 .90 
Negative Range —52to -10 -—6lto -2 -40to0 —61 to 0 


number of words they know in respect to both positive and nega- 
tive estimate scores. The lower group presents evidence of 
greater overestimation of vocabulary size than does the middle 
group: this group, in turn, presents a greater deviation than the 
upper group. The negative estimate scores indicate that the 
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greatest underestimation of unknown words occurs in the middle 
group, while both the lower and upper groups underestimate less. 
On inspection of the ranges of the estimate scores for the sub- 
groups, it will be noted that with rise in class standing the sub- 
jects tend to become more conservative in their estimations of 
vocabulary size. 

In Table 2b are presented the critical ratios obtaining between 
the estimate scores of the subgroups and entire group. At the 


TABLE 28.—CRITICAL RATIOS OBTAINING BETWEEN THE MEAN 
ESTIMATE SCORES OF THE THREE SUBGROUPS AND ENTIRE 
GROUP 


Critical Ratios 
Land M LandU LandE Mand U MandE Uand E 


Positive .39 3.46 1.26 3.64 1.14 3.06 
Negative .60 2.73 .46 3.80 1.38 3.07 


one per cent level, the differences between the upper group and 
the lower, middle, and entire groups are significant for both 
positive and negative estimate scores. In other words, while all 
groups tend to overestimate the number of words they knew, the 
upper group does so to a significantly smaller degree than the 
others. 

Correlations between semester raw scores and vocabulary test 
scores are presented in Table 3. The values of ‘t’ were com- 


TABLE 3.—CORRELATIONS OBTAINING BETWEEN SEMESTER RAW 
ScORES AND VOCABULARY TEST SCORES FOR THE SUBGROUPS 
AND ENTIRE GROUP 


Group r S.E. t 
Lower 25% .02 BY .12 
Middle 50% . 28 11 2.42 
Upper 25% .07 ae 41 
Entire Gr. .39 .07 5.03 


puted to test whether the obtained coefficients differed sig- 
nificantly from zero. The t-values yielded indicate that the 
correlations obtained for the lower and upper groups are not 
significantly different from zero, while the r-values of the middle 
and entire groups are significant at the two and one per cent 


levels. 
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CONCLUSIONS 


1) The null hypothesis of equality of mean percentages of 
correctly defined words placed in the categories of the three- and 
five-item keys by the subgroups and entire group is not untenable 
with the data presented. One would therefore fail to reject the 
null hypothesis. However, despite the fact that the differences 
obtaining between the values of the measure for the different 
groups are not statistically significant, there is a consistent 
trend present in the direction of greater awareness of vocabulary 
size with rise in class standing. 

2) The null hypothesis of equality of positive and negative 
estimate scores for the subgroups and entire group is rejected at 
the one per cent level. The upper group underestimates and 
overestimates to a significantly smaller degree than the lower, 
middle, and entire groups. Since awareness of vocabulary size 
has been defined as a function of the estimate scores, one may 
conclude that the upper group possesses a greater awareness of 
the size of its vocabulary than the other groups. The tendency 
of the middle group to manifest greater awareness of the size of 
its vocabulary than the lower group is present but not consistent. 

3) On the basis of t-values presented, the null hypothesis of 
zero correlation between semester raw scores and vocabulary 
test scores is rejected at the two and the one per cent levels for 
the middle and entire groups. There is a trend in the direction 
of larger vocabulary size with rise in class standing. 

4) In view of the data presented, one may conclude that as 
class standing rises, i.e., in rising from lower to middle to upper 
achievement groups in this group of subjects, greater awareness 
of vocabulary size is manifested. 


Stupy II: AWARENESS OF VOCABULARY SIZE AND 
Sex DIFFERENCES 


AIMS OF THIS STUDY 


Awareness of vocabulary size, defined as a function of the 
deviation between estimated and obtained scores in a testing 
situation, has been found to be possessed by students to a remark- 
ably small degree. It has been shown that, irrespective of class 
standing and vocabulary size, students in general tend to over- 
estimate the number of words they know. 
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It is the aim of this paper to determine whether this poor 
judgment can be explained on the basis of sex differences in esti- 
mation of vocabulary size. The null hypotheses to be tested 
are the following: 

1) There is no difference between males and females in mean 
percentages of correctly defined words placed in the categories 
of the three- and five-item keys of word familiarity. 

2) There is no difference between males and females in respect 
to overestimation and underestimation of vocabulary size. 


RESULTS AND DISCUSSION 
Mean percentages of correctly defined categorized words.—In 


Table 4a are presented the mean percentages of correctly defined 


TABLE 44.—MEAN PERCENTAGES OF CORRECT DEFINITIONS OF 
Worps PLACED IN THE CATEGORIES OF THE FIVE-ITEM KEY 
BY THE MALE, FEMALE, AND ENTIRE GROUPS 


Category Male Female Entire Group 
1 63.2 59.4 61.3 
2 48.5 44.9 46.7 
3 44.2 38.4 41.3 
4 36.4 31.8 34.1 
5 27.0 25.3 26.2 


words placed in the categories of the five-item key by the male, 
female, and entire groups. The male group consistently defines 
a greater percentage of words than the female and entire groups. 
The entire group assumes an intermediate position in each 
category with respect to the male and female groups. 

Of the words placed in category 1 of the five-item key, the males 
failed to define correctly thirty-seven per cent of the words, the 
females missed forty-one per cent, and the entire group missed 
thirty-nine per cent. The males failed to define fifty-two per 
cent of the words placed in category 2; the females, fifty-five per 
cent; and the entire group, fifty-three per cent. The males knew 
forty-four per cent of the words placed in category 3, the females 
knew thirty-eight per cent, and the entire group knew forty-one 
per cent. Of the words placed in category 4, the males knew 
thirty-six per cent; the females, thirty-two per cent; and the 
entire group, thirty-four per cent. A chance percentage of 
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correct definitions of those words placed in category 5 was 
obtained for each of the groups. 

Table 4b presents the mean percentages of correct definitions 
of words placed in the categories of the three-item key by the 


TABLE 48B.—MEAN PERCENTAGES OF CORRECTLY DEFINED 
Worps PLACED IN THE CATEGORIES OF THE THREE-ITEM 
KEY BY THE MALE, FEMALE, AND ENTIRE GROUPS 


Category Male Female Entire Group 
Positive 59.6 56.5 58.0 
Uncertain 44.2 38 .4 41.3 
Negative 31.0 27.6 29.2 


male, female, and entire groups. The males consistently define 
a greater percentage of words correctly than the entire group 
which, in turn, consistently defines a greater percentage of words 
than the female group. 

Critical ratios computed between the three groups for each 
category of the three- and five-item keys yield no significant 
differences either between the sexes or between the entire group 
and the sex groups. 

The estimate scores of the male, female, and entire groups are 
presented in Table 5. As a whole, the groups overestimate the 


TaBLE 5.—MEAN EstTIMATE Scores, STANDARD ERRORS, AND 
RANGES OF THE MALE, FEMALE, AND ENTIRE GROUPS 


Entire 
Score Measure Male Female Group 
Mean S.E. 16.4 1.2 15.0 1.3 15.7 .89 
Positive Range 15 to 43 —13to38 —13 to 43 
Mean 8.E. —27.41.3 —26.4 1.2 -—26.8 .90 
Negative Range —3to—-61 Oto —5l 0 to —61 


number of words they know in respect to both positive and nega- 
tive estimate scores. The female group consistently overesti- 
mates less than the male and entire groups. The male group 
consistently overestimates to a greater degree than the entire 
group. Critical ratios computed between the estimate scores of 
the three groups yield no significant differences in respect to sex 
differences in overestimation of vocabulary size. 
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CONCLUSIONS 


1) The null hypothesis of equality of mean percentages of 
correctly defined words placed in the categories of the three- and 
five-item keys by the male, female, and entire groups is not 
untenable with the data presented. Although the males define 
correctly a greater percentage of categorized words than the 
female and entire groups, the differences observed are not statisti- 
cally significant. 

2) The null hypothesis of equality of estimate scores of male 
and female groups is not rejected. The males overestimate the 
number of words they know and underestimate the number they 
do not know to a greater, although not statistically significant, 
degree than the female and entire groups. 

3) On the basis of the data presented, one must conclude that 
in this group of students, sex differences cannot account for the 
lack of awareness displayed by college students in respect to 
vocabulary size. 
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SOME COMMENTS CONCERNING THE ROLE OF 
EDUCATION IN THE “CREATION 
OF CREATION”? 


EDITH A. WEISSKOPF 


Department of Psychology 
Purdue University 


Most people would agree that it is one of the main aims of 
education to encourage the: development of creative abilities. 
Therefore, educators are forced to concern themselves with the 
following question: What are the conditions which enhance the 
formation of creative thought? The literature in psychology and 
related fields contains a considerable amount of information 
about creative thought and the conditions of its growth. This 
information is based on general impression, on the analysis of 
biographical data, on introspective statements of creative think- 
ers, or On experimental research. In spite of the diversity of its 
origin the literature shows a considerable amount of agreement 
as to some of the conditions enhancing creativity. Thus, one 
would expect educators to use techniques in their intellectual 
work similar to the techniques which are thought to create a 
fertile soil for the growth of new ideas. 

Strangely enough this is not often the case. It is the objective 
of this paper to discuss some specific aspects of the well-known 
fact that the mental attitude during intellectual work encouraged 
by our schools is in many respects diametrically opposed to the 
mental attitude which is thought to stimulate creativity.”:* 

Most educators in their oral or written admonitions to their 
pupils stress the importance of industry. Yet, if we scan through 
the literature on creativity, we find statements such as the follow- 
ing: “ . . . though without industry great intellectual work can 
not be done, yet mere industry may prevent creation’’'® (p. 149). 
Educators emphatically stress the first part of this statement 
when teaching students, but are, with few exceptions (e.g.?°), 
reluctant even to mention the second part. The constant activ- 
ity enforced by many educators does not give young people the 





' A paper presented at the Annual Meeting of the American Psychological 
Association, September, 1950. 
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leisure which is an essential prerequisite for intellectual or artistic 
creation.”:!8 

We teach our pupils the importance of regular study habits. 
Why do we ignore statements such as the following? ‘The 
advantage of regular habits during the student years is great; but 
the optimum point at which the curve of that advantage cuts the 
curve of the advantage of fresh initiative is different for those 
whose professional work will be intellectual origination, and for 
those of different powers and aims’’® (p. 297). If pupils would 
use the explicit admonitions received from their teachers as their 
only source of knowledge on study techniques, it would never 
occur to them that regular study habits could be anything but 
advantageous. 

We try to instill a critical attitude towards intellectual matters 
in our students, but Hutchinson quotes a scientist saying: 
“It is surprising how fertile the mind becomes when not 
interrupted, or restricted by criticism in the free and often 
fantastic expression of its ideas’’® (p. 32). 

Our school system stresses the importance of intellectual objec- 
tivity and the liabilities of one-sidedness. Thus,“ . . . students 
often start their careers with some intense enthusiasm, then 
learn in college how narrow and ‘immature’ it is, how much can 
be said on the other side, and how many other good ways of doing 
things exist. They gradually acquire a spirit of mellow tolerance 
and scholarly insight, but lose the creative impulse’’!! (p. 301). 

It is challenging to examine in this connection suggestions made 
by psychiatrists or clinical psychologists on the mental attitude 
which leads to optimal results in producing insight and under- 
standing with regard to their patients’ problems. Freud, for 
example, makes the following recommendations on treatment tech- 
niques: “‘ All conscious*exertion is to be withheld from the capac- 
ity for attention .. . ”?(p.324). ‘‘Onehassimply to listen and 
not to trouble to keep in mind anything in particular” (p. 325). 
“Tt is not a good thing to formulate a case scientifically while 
treatment is proceeding, to reconstruct its development, antici- 
pate its progress, and take notes from time to time of the condition 
at the moment, as scientific interest would require” (p. 326-7). 
“« | . . the most successful cases are those in which one proceeds, 
as it were, aimlessly, and allows oneself to be overtaken by any 
surprises, always presenting to them an open mind, free from 
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expectation” (p. 327). Reik also emphasizes the liabilities of 
continuous sharp attention and the benefits of inattentive states: 
“ |. . training in attention may succeed “too well,’’ so that the 
child is shut off from a whole flood of freely emerging associations 
and ideas”? (p. 165). ‘‘The concentration of attention involves 
a setting up of inhibitions, so that keen observation of particular 
things corresponds to the ignoring of others. Withdrawal of 
certain contents from consciousness is no less a part of attention 
than the appearance of others at the center. It is open to ques- 
tions which part is the more significant to the essence and action 
of attention, the illumination of particular objects, the bestowal 
of intensive interest upon them, or its diversion away from 
others” (p. 164). ‘‘Not until I cast off the customary restraint 
of voluntary attention was I able to get hold of the hidden psy- 
chical data. Not until I left the firm and broad high road did I 
reach the goal along side paths. The secret meaning escaped my 
conscious, active attention, and was not found until I had become 
‘inattentive’ in the popular meaning of the word, that is, until I 
gave myself up to unconscious ideas of the goal” (p. 169). 

If a student would show a mental attitude towards his school 
work similar to the one advocated by these clinicians, he would 
elicit the serious criticism of his teachers. The objection may be 
raised that intellectual work of the academic type is basically 
different from the achieving of insight into the personality prob- 
lems of a neurotic patient, and that it, therefore, requires a differ- 
ent mental attitude. Quite true. Yet the literature on creative 
thought suggests that Freud’s admonitions may prove useful in 
any situation where insight is required. Moreover, Maier and 
Luchins show that the arriving at solutions of various kinds of 
problems is facilitated by such ‘loosening’ instructions as ‘‘ Do 
not be a creature of habit and stay in a rut,’”’ ‘“‘Keep your mind 
open for new meanings,” ‘‘ The solution pattern appears suddenly. 
You cannot force it’’® (p. 147), or “Don’t be blind’’® (p. 2). 

Thus, educators may find it profitable to advocate a passive, 
inattentive attitude to their students not as a permanent condi- 
tion but as a stimulating technique during specific phases of their 
intellectual work. As Bateson states “ ... the advances in 
scientific thought come from a combination of loose and strict 
thinking, and this combination is the most precious tool of 
science’! (p. 55). 
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At this point I expect some readers to voice objections. ‘‘Indus- 
try, regular study habits, a critical, controlled attitude, are 
necessary attributes of successful intellectual work,” they will 
say. ‘‘The creative individuals who found that relaxation, 
irregular habits, and temporary abandonment of criticism and 
control enhance creativity have spent years of strenuous prepara- 
tion during which they exercised all the virtues advocated by 
traditional educators.’’ Quite true. Yet the question remains 
unanswered why, among the four stages of the creative process, 
namely, preparation, incubation, illumination, and verifica- 
tion*—7.12-16,18 we prepare children for the first and last stage only, 
and completely ignore the other two stages.'® Only one possible 
cause of this educational bias will be suggested in this paper. 

The process of insight and illumination appears to be directed 
by unconscious forces to a higher degree than the process of 
preparation and verification. Just as lightning represents the 
sudden explosive merging of positive and negative electricity, 
intellectual insight may represent the sudden merging of con- 
scious content accumulated during the stage of preparation with 
unconscious, repressed material.** Thus, it may be the denial of 
this repressed material which causes educators to deny the process 
of illumination. In other words, we do not want to accept the 
fact that intellectual activity takes part of its energy supply 
from the big reservoir of unacceptable impulses called the Id, 
that the material carefully and rationally accumulated during 
the period of preparation has to become imbued with Id-impulses 
in order to be brought to life. We falsely admonish our students 
to exert control in order to keep these impulses repressed instead 
of teaching them to make their impulses subservient to intel- 
lectual activity. 

Just as we citen deceive small children about the nature of 
biological creation, we deceive older children by pretending that 
intellectual creation is unemotional. Thus, by de-emotionalizing 
intellectuality and by keeping the ‘sweet secret’ of intellectual 
creation from youth, we suffocate intellectual creativity. 
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BOOK REVIEWS 


Cexia B. StTenDLER. Children of Brasstown. Urbana, Illinois: 
Bureau of Research and Service of the College of Education, 
University of Illinois, 1949, pp. 103. 


This monograph is primarily concerned with the awareness of 
‘social class’ and class symbols of first-, fourth-, sixth- and eighth- 
grade children in one school in a New England industrial town of 
15,000 inhabitants. 

The author identified four classes in the community ‘by inter- 
views with townspeople.’ Unfortunately, we are not told how 
many people were interviewed, how they were selected, nor which 
‘classes’ they represent. The one hundred seven children in the 
study were then rated by five adult judges as belonging to the 
upper middle, ‘white collar,’ or working class. An upper class 
category was not used by the judges since it was presumed that 
upper class children did not attend public schools. There seems 
to be a class bias in the ratings since none of the judges repre- 
sented the working class. Further, in discussing the ‘social 
classes’ the author sometimes refers to them as socio-economic 
categories and at other times as prestige categories. It is unfor- 
tunate that more rigorous procedures were not used in stratifying 
the community and in class placement of the children, for the 
remainder of the research is carefully designed. 

The children were interviewed in regard to their out-of-school 
activities and their friendship choices. They were asked to 
rate four sets of pictures representing class differentials in jobs, 
houses, clothing and recreation. In addition, they rated their 
own and classmates’ ‘social class’ position and answered a ‘guess 
who’ test. Some of the major substantive conclusions were: 
(1) the higher the grade in general the greater the awareness of 
class symbols; (2) in the lower grades, boys are more discerning 
in awareness of class symbols and in the higher grades girls seem 
to be more discerning; (3) the higher one’s class placement, the 
more conscious one is of class symbols; (4) out-of-school activities 
such as work activities and music and dancing lessons are con- 
ditioned by class placement of parents; (5) in-school choices are 
more democratic than out-school choices in friendship selections; 
(6) the higher the grade, the greater the awareness of class posi- 
tion of schoolmates and the greater the awareness of class as 
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revealed in future occupational choice. Miss Stendler also offers 
certain generalizations applying to all class levels and sets up 


four levels of awareness of class symbols. 


In the final chapter the author discusses the implications of her 
findings. She raises some trenchant questions regarding the 
unrealism of typical social studies programs in the primary grades 
and the proposed solutions of Allison Davis and others to the 
lower class child’s anxiety over loss of prestige. Educational 
psychologists and school administrators will find this chapter 


exceedingly provocative. 

This reviewer was somewhat dissatisfied with the omission of 
supporting tables and statistical analyses for certain conclusions. 
One would like to be able to judge for oneself, for example, the 
significance of the analysis of variance tests rather than accept 
the author’s judgments without supporting data that certain 
relationships are statistically significant (p. 53). NraL Gross 

University of Minnesota 


E. G. WiuuiaMs, Editor. Trends in Student Personnel Work. 
Minneapolis: University of Minnesota Press, 1949, pp. 417. 


This book presents forty-three papers by nearly as many 
authorities, read at the University of Minnesota in connection 
with a three-day meeting of ‘‘associates in personnel work”’ 
from various institutions of higher education. The papers 
present (1) an overview of student personnel work during the 
last quarter century, (2) some of the difficulties encountered 
along the way, and (3) points up current best practices, covering 
research on methods, tests, counseling techniques and therapeutic 
procedures. This book can be read with interest and profit 
both by students and workers. ‘The papers outline the growing 
edge of student personnel work.’”’ It should be particularly 
helpful to anyone—student or professor—wishing to gain 
acquaintance with the field both as to present status and future 
promise. 

The broad scope of the papers may be seen from the following 
listing of the main divisions of the book (contributors to each 
division are also here listed): Part I, The Réle of Personnel Work 
in Higher Education (M. M. Willey, W. H. Cowley, M. S. 
MacLean and H. Y. McClusky). Part II, Developments in 
Tests and Testing (J. G. Darley and D. B. Stuit). Part III, 
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Vocational Counseling and Occupational Adjustment (D. G. 
Paterson and C. L. Shartle). Part IV, Counseling Methods and 
Points of View (E. 8. Bordin, E. H. Porter, Jr.). Part V, Mental 
Hygiene (R. G. Hinckley, R. F. Berdie and Mrs. Anne Fenlason). 
Part VI, Problems of Student Veterans (R. J. Strom and §. J. 
Wenberg). Part VII, Special Personnel Services (F. G. Moore, 
Mrs. Dorothy Dyer, W. C. Stevens, Mrs. Margaret Aldrich, 
J. D. Foley, Bryng Bryngelson, G. B. Risty, Ruth E. Boynton), 
Part VIII, The Role of Student Housing in Education (B. J. 
Borreson, A. H. Drammen). Part IX, Personnel Work as a 
Profession (Esther Lloyd-Jones, C. G. Wrenn and J. G. Darley). 
Part X, Counseling by Faculty Members (D. D. Feder, W. M. 
Gilbert, D. J. Shank and J. L. Bergstresser). Part XI, Religious 
Programs and Student Personnel Work (EK. G. Williamson, Rey. 
James Boren, Rabbi Norman Frimer, Father Leonard Cowley). 
Part XII, Social Adjustment through Activities (W. H. Cowley, 
R. L. Sutherland). Part XIII, Personnel Problems of Women 
(Kate Hevner Mueller, Esther Lloyd-Jones). Part XIV, Testing 
and the Curriculum (C. P. Turner, R. W. Tyler). 
J. B. Stroup 
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